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Ideal pierdut in noapte a unei lumi ce nu mai este , 

Lume ce gandea in basme §i vorbea in poezii, 

O! te vad, te-aud, te cuget, tanara §i dulce veste 
Dintr-un cer cu alte stele , cu-alte raiuri, cu alp zei. 

— Mihai Eminescu, “Venere §i Madona” 

Oh, ideal lost in night-mists of a vanished universe: 

People who would think in legends — all a world who spoke in verse; 

I can see and think and hear you — youthful scout which gently nods 
From a sky with different starlights, other Edens, other gods. 


— Mihai Eminescu, “Venus and Madonna” 
(translated by Andrei Banta§) 



This image is a courtesy of The Times of London. Printed in the February 3, 
2004 issue. 





Preface 


Rosencrantz: Shouldn’t we be doing something... constructive? 
Guildenstern: What did you have in mind? 

— Tom Stoppard, Rosencratz and Guildenstern Are Dead 


We have written this book in order to provide an introduction to constructive anal- 
ysis, emphasising techniques and results that have been obtained in the last twenty 
years. The intended readership comprises senior undergraduates, postgraduates, 
and professional researchers in mathematics and theoretical computer science. We 
hope that our work will help spread the message that doing mathematics construc- 
tively is interesting (it can even be fun!) and challenging, and produces new, deep 
computational information. 

An appreciation of the distinction between constructive and nonconstructive has 
become more widespread in this era of computers. Nevertheless, there are few books 
devoted to the development of mathematics in a rigorously constructive/computable 
fashion, although there are some, primarily concentrating on logic and foundations, 
in which the odd chapter deals with constructive mathematics proper as distinct 
from its underlying logic or set theory. It is now almost forty years since the publi- 
cation of Errett Bishop’s seminal monograph Foundations of Constructive Analysis 
[9], which in our view is one of the most remarkable intellectual documents of the 
twentieth century, and more than twenty since the appearance of its outgrowth 
[12]. In the intervening years there has been considerable activity in constructive 
analysis, algebra, and topology; in related foundational areas such as type theory 
[69]; and in the relation between constructive mathematics and computer science 
(for example, program extraction from proofs [42, 70, 51]). Believing that a new 
introduction to the mathematical, as distinct from the foundational, side of the 
subject is overdue, we embarked upon this monograph. 

Our book is intended not to replace, but to supplement, Bishop’s original classic 
[9] and the later volume [12] based thereon. Both of those two monographs cover 
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aspects of analysis, such as Haar measure and commutative Banach algebras, that 
we do not mention. We cover some topics that are found in [9] and [12] (it would be 
almost inconceivable to produce a book like ours, dealing with constructive mathe- 
matics for nonexperts, without proving, for example, basic results about locatedness 
and total boundedness); but we have tried to provide improved proofs whenever 
possible. However, much of the material we present was simply not around at the 
time of writing of [9] or [12]. 

Instead of systematically developing analysis, beginning with the real line and 
continuing through metric, normed, and Hilbert spaces to its higher reaches, we 
have chosen to write the chapters around certain themes or techniques (hence our 
title). For example, Chapter 3 is devoted to the A-technique, which, since its first use 
in the proof of Lemma 7 on page 177 of [9], has become a surprisingly powerful tool 
with applications in many areas of constructive analysis. A major influence in the 
application of the A-technique was Ishihara’s remarkable paper [60], which showed 
that a subtle use of the technique could enable us to prove disjunctions whose proof, 
although trivial with classical logic, appears at first sight to be constructively out 
of the question. This paper opened up many new pathways in constructive analysis. 

Chapter 1 introduces constructive mathematics and lays the foundations for 
the later chapters. In Chapter 2 we first present a new construction of the real 
numbers, motivated by ideas in [2]. After deriving standard properties such as the 
completeness of K, we introduce metric spaces, with the major theme of locatedness, 
and normed linear spaces. When we discuss metric, normed, and Hilbert spaces, we 
assume some familiarity with the standard classical definitions of those concepts 
and with those elementary classical properties that pass over unchanged to the 
constructive setting. 

Chapter 3 we have already referred to. The main theme of Chapter 4 is finite- 
dimensionality, but the chapter concludes with an introduction to Hilbert spaces. 

Chapter 5 deals with convexity in normed spaces. Starting with some elementary 
convex geometry in R", the chapter goes on to handle separation and Hahn-Banach 
theorems, locally convex spaces, and duality. Following Bishop, we describe those 
linear functionals that are weak* -uniformly continuous on the unit ball of the dual 
space. We then give a new application of the technique used to prove that result, 
thereby characterising certain continuous linear functionals on the space of bounded 
operators on a Hilbert space. 

In Chapter 6 we derive a range of results associated with the theme of located- 
ness and with the A-technique introduced in Chapter 3. We pay particular attention 
to necessary and sufficient conditions for convex subsets of a normed space to be 
located, and to connections between properties of an operator on a Hilbert space 
and those of its adjoint — when that adjoint exists: it may not always do so construc- 
tively. The final section of the book deals with a relatively recent version of Baire’s 
theorem and its applications, and culminates in constructive versions of three of 
the big guns in functional analysis: the open mapping, inverse mapping, and closed 
graph theorems. 
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IX 


Which parts of the book deal with new material, compared with what appeared 
in [12]? We have already mentioned the new construction of the real numbers, in 
Chapter 2. Notable novelties in the later chapters include all but one result in 
Chapter 3 on the A-technique; the section on convexity, Ishihara’s results on exact 
Hahn-Banach extensions, and our characterisation theorem for certain continuous 
linear functionals, all in Chapter 5; and virtually all of Chapter 6. Throughout the 
book there are what we hope will be seen as improvements and simplifications of 
proofs of many results that were given in [9] or [12]. 

What do we mean by “constructive analysis” in the title of this book? We do 
not mean analysis carried out with the usual “classical” logic within a framework, 
such as recursive function theory, designed to capture the concept of computability. 
In our view, such a notion of constructive has at least two drawbacks. First, by 
working within, say, the recursive setting, it can make the mathematics look less 
like normal mathematics and much harder to read. Secondly, the recursive con- 
straint removes the possibility of other interpretations of the mathematics, such as 
Brouwer’s intuitionistic one [48]. Our approach, on the other hand, has neither of 
these features: the mathematics looks and reads just like the mathematics one is 
used to from undergraduate days, and all our proofs and results are valid in several 
models. They are valid in the recursive model, in intuitionistic mathematics, and, we 
believe, in any of the models for “computable mathematics” (including Weihrauch’s 
Type Two Effectivity Theory [91], within which Andrej Bauer has recently found a 
realisability interpretation of constructive mathematics within Weihrauch’s theory 
[5]). They are also valid proofs in standard mathematics with classical logic. For 
example, our proof of the Hahn-Banach theorem (Theorem 5.3.3) is, as it stands, 
a valid algorithmic proof of the classical Hahn-Banach theorem. Moreover — and 
this is one advantage of a constructive proof in general — our proof embodies an 
algorithm for the construction of the functional whose existence is stated in the 
theorem. This algorithm can be extracted from the proof, and, as an undeserved 
bonus, the proof itself demonstrates that the algorithm is correct or, in computer 
science parlance, “meets its specifications”. 1 

So how do we achieve all this? Simply by changing the logic with which we do 
our mathematics! Instead of using classical logic, we systematically use intuition- 
istic logic, which was abstracted by Heyting [52] from the practice of Brouwer’s 
intuitionistic mathematics. The remarkable fact is that every proof carried out with 
intuitionistic logic is fully constructive/algorithmic. (Is this the “secret on the point 
of being blabbed” that appears in the epigraph to Bishop’s book?) Unfortunately, 
too few mathematicians outside the mathematical logic community are aware of 
this serendipity and dismiss both intuitionistic logic and constructive mathematics 
as at best a marginal curiosity. This contrasts sharply with the theoretical com- 
puter science community, in which there is considerable knowledge of, and interest 
in, the computational power of intuitionistic logic. 


x We do not carry out program-extraction from proofs in our book. For more on this 
topic see [42, 51, 70]. 
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Reading constructive mathematics demands careful interpretation. A theorem 
in this book might look like a familiar one from classical analysis, but with more 
complicated hypotheses and proof. However, the statement of the theorem will be 
phrased so that the explicit algorithmic interpretation is left to the reader; and the 
additional hypotheses will be necessary for a constructive proof, which will contain 
algorithmic information that is excluded from the classical proof by the latter’s 
use of principles outside intuitionistic logic. Consider, for example, the following 
statement: 

(*) Let C be an open convex subset of a normed space X, let £ G C, and 
let z € X be bounded away from C. Then the boundary of C intersects the 
segment [£, z] joining f and z. 

This is trivial to prove classically; but to find/construct the (necessarily unique) 
point in which the boundary of C intersects [£, z\ is a totally different matter. The 
constructive theorem (Proposition 5.1.5 below) requires us to postulate that the 
union of C and its metric complement —C (the set of points bounded away from 
C) be dense in X , and that X itself be a complete normed space. The constructive 
proof, though elementary, requires some careful geometrical estimation that would 
be supererogatory in the natural classical proof by contradiction. The benefit of 
that estimation and of the use of intuitionistic logic is that we could extract from 
the constructive proof an implementable algorithm for finding the point where 
the segment crosses the boundary. In turn, this would enable us to produce an 
algorithm for constructing separating hyperplanes and Hahn-Banach extensions of 
linear functionals, under appropriate hypotheses. 

We could have made the algorithmic interpretation of the constructive version 
of (*) explicit by stating the proposition in this way: 

There is a “boundary crossing algorithm” that, applied to the data consisting 
of { i) an open convex set C in a Banach space X such that CU—C is dense 
in X, (ii) a point £ of C , and (iii) a point z of —C, constructs the point 
where the boundary of C intersects the segment [£, z] . 

Even this is not really explicit enough. A full description of the data to which the 
boundary crossing algorithm applies would require explicit information about the 
algorithms for such things as these: membership of (7; the convergence of Cauchy 
sequences in X\ the computation, for given x in X and e > 0, of a point y of CU—C 
such that ||x — y 1 1 < e (and even the decision between the cases “ y G C” and “y G 
— C”); and so on. Such explicit description of algorithmic hypotheses would become 
an ever greater burden on writer and reader alike as the book probed deeper and 
deeper into abstract analysis. It is a matter of sound sense, even sanity, to unburden 
ourselves from the outset, relying on the reader’s native wit in the interpretation 
of the statements of our constructive lemmas, propositions, and theorems. 

We should make it clear that we are not advocating the exclusive use of in- 
tuitionistic logic in mathematics. That logic is, we believe, the natural and right 
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one to use when dealing with the constructive content of mathematics. To abandon 
classical logic in those fields (such as the higher reaches of set theory) where con- 
struct vity is of little or no significance makes no sense whatsoever. Nevertheless, 
it is remarkable how much mathematics actually has what Bishop called “a deep 
underpinning of constructive truth” . 


Christchurch, New Zealand Douglas Bridges 

January 2006 Luminifa Simona Vifa 
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Introduction to Constructive Mathematics 


My task which I am trying to achieve is, by the power of the written word, to 
make you hear, to make you feel -it is, before all, to make you see. That, and no 
more, and it is everything. 

— Joseph Conrad, The Nigger of the ‘Narcissus’ 


In this chapter we first sketch the history and philosophy that motivated the early workers 
in the field of constructive mathematics. We then describe informal intuitionistic logic 
and discuss a number of elementary classical theorems that do not carry over to the 
constructive setting. Finally, we introduce an informal constructive theory of sets and 
functions. All this will prepare us for the presentation of the constructive theory of the 
real line R. in Chapter 2, and for the more abstract analysis that will be described in later 
chapters. 


1.1 What Is Constructive Mathematics? 

Proposition 20 in Book IX of the thirteen volumes of Euclid’s Elements states that 

Prime numbers are more than any assigned multitude of prime numbers 

— in current terms, there are infinitely many primes. The modernised version of 
Euclid’s proof is often presented as follows. Suppose that there are only finitely 
many primes, say p\, ... ,p n , and consider the integer 

p = pi x p 2 x • • • x p n + 1. 

Being greater than 2, p has prime factors (it may even be prime itself). Since the 
numbers pk are not divisors of p, each prime factor of p is distinct from each pk ■ 
This is absurd, since {p \ , . . . ,p n } is supposed to be the set of all primes. From this 
contradiction we conclude that the set of primes is infinite. 
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Although at one level there appears to be nothing untoward about this proof, 
it can be criticised on two counts. First, it uses a totally unnecessary contradiction 
argument. If you look carefully, you will see that the proof actually embodies an 
algorithm that, applied to any finite set {pi,...,p n } of primes, enables you to 
compute a prime that is distinct from each element of that set. In other words, 
the use of a contradiction argument in the preceding paragraph has obscured the 
computational content of Euclid’s proof. 

The second criticism of the proof is a little more subtle, and deals with the 
notion of “infinitely many”. The proof is based on the negative idea that a set is 
infinite if and only if it is contradictory that it be finite. But an algorithmic re- 
casting of Euclid’s proof, as suggested in the preceding paragraph, shows that the 
set S of primes is infinite in a more positive, productive sense: namely, if we start 
with a finite subset F of S, then we can compute an element of S that is distinct 
from each element of F. 

From a traditional standpoint, the distinctions between the contradiction proof 
of Euclid’s theorem and the algorithmic one, and between the negative and positive 
notions of “infinite” , are obscured if not invisible. For example, the two notions of 
“infinite” are equivalent if we use traditional logic — or classical logic, as it is nor- 
mally called — so the distinction has to be perceived at an aesthetic level rather than 
a mathematical one. The same applies, more generally, to a proof by contradiction 
of the existence of an object x with the property P(x). In such a proof one sup- 
poses that P(x) is false for all applicable objects x, deduces a contradiction, and 
then concludes that P(x) must, after all, hold for some x, even though the proof 
doesn’t tell us which x actually has the desired property. Classical logic draws no 
distinction between the “idealistic existence” demonstrated by such a proof and the 
“constructive existence” based on an algorithm that constructs x and shows that 
P{x) holds. In order to reveal such distinctions at a mathematical, rather than an 
aesthetic, level we shall adopt the radical expedient of changing our logic: through- 
out this book, we shall work with intuitionistic logic , an abstraction of the informal 
logic used in algorithmic thinking. 

How much analysis, as normally presented, is really nonconstructive — that is, 
essentially dependent on proofs by contradiction or other nonalgorithmic proce- 
dures? Consider, for example, the classical intermediate value theorem : 

If f : [a, b\ — > R. is a continuous mapping such that /(a) < 0 and f(b) > 0, 

then there exists c £ (a, b) such that f(c) = 0. 

(Note that [a, b } and (a, b) respectively denote the closed and open intervals with 
endpoints a and b. We shall use standard notations, like (a, 6] for the half-open 
interval, without further comment.) 

It might be thought that the common elementary proofs of the intermediate 
value theorem are constructive, enabling one to produce a zero c of the function /. 
For example, one proof uses interval-halving in the following way. Without loss of 
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generality, take a = 0 and 6=1. Consider /(1/2): if it is 0, then we take c = 0 
and stop the process; if /(1/2) > 0, then / satisfies the hypotheses of the theorem 
with a = 0 and b = 1/2; if /(1/2) < 0, then / satisfies the hypotheses with a = 1/2 
and b = 1. In each of the last two cases, we proceed with the interval-halving. This 
process either stops after a finite number of iterations and produces the required 
zero of /, or else it goes on ad infinitum to produce a descending sequence of 
compact intervals whose unique point of intersection is the required zero. Isn’t this 
a fully algorithmic proof? 

Suppose we try to implement the algorithm embodied in this proof on a com- 
puter that works with 50-bit precision. What happens if we apply it to the cubic 
function / defined on [0, 1] by 



Here, 

/( 0 ) = ~~ 2" 51 < 0 , /( 1 ) = ^ - 2- 51 > 0 , 

16 16 

so we are well set to carry out the first step of the interval-halving algorithm. Since 
our computer’s floating-point representation of /( 1/2) is 0 (we have the phenom- 
enon of underflow , in which the computer sets the small but nonzero number — 2~ 51 
equal to 0), the algorithm stops by outputting c = 1/2 as the place where / has a 
zero. But in this case the only zero of / in [0, 1] lies between 3/4 and 1, more than 
one quarter of the entire interval away from the output value 1/2. 

Now, one could object that this example is misleading, in that the problem 
arises from the level of precision in the computer rather than any intrinsic failing 
in the algorithm itself. To deal with this point, for each positive integer n let G(n) 
signify that 2n + 2 is a sum of two primes. Construct a binary sequence (a n ) n>1 
such that for each n, 

a n = 0 if and only if either G(k) for all k ^ n or else there exists k < n 
such that ^G(k). 

OO 

Define a = a n 2~ n , and note that a = 0 if and only if the Goldbach conjecture, 

n— 1 

Every even integer greater than 2 is a sum of two primes, 

holds. Using classical logic, apply the classical interval-halving algorithm to the 
cubic function / defined on [0, 1] by 



As long as the status of the Goldbach conjecture remains undecided (which it has 
done since the conjecture first appeared in 1742), no matter what finite precision 
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our computer has, the algorithm will output 1/2 as a zero of /; but if the Goldbach 
conjecture is false, then /(1/2) = —a < 0 and the first zero of / in [0, 1] occurs 
between 3/4 and 1. In fact, the classical algorithm will give the correct output if 
and only if the Goldbach conjecture is true. However, as the reader may verify, a 
constructive approximate interval-halving argument, such as that expected in the 
solution of Exercise 11, does not give the possibly false value 1/2 for a zero of /. 
It produces a value that approximates the zero of / lying between 3/4 and 1, as 
accurately as the precision of the computer permits. 

In this example, the classically (but not constructively) defined function taking 
the parameter a to the smallest root r(a) of / is discontinuous at a = 0. The 
classical algorithm correctly outputs r(a) = 1/2 in the case a = 0; but if a > 0, 
then, by outputting the value 1/2, the algorithm has failed to spot that the value of 
r(a) jumps from 1/2 to more than 3/4 as the parameter a increases from 0. There 
is a general principle that constructive proofs will involve continuity in parameters. 
Thus we cannot expect to prove constructively that for each a the above cubic 
function / has a smallest zero. 

The problem with the classical interval-halving algorithm is that the finite pre- 
cision of the computer prevents it from making correct comparisons between two 
very close, but distinct, real numbers. If we are to develop mathematics in a compu- 
tational manner, we have to ensure that such comparisons are barred. This barring 
can be done in at least two ways. One way is to use classical logic and to preclude 
nonalgorithmic “decisions” (such as whether two given numbers are equal) by devel- 
oping the mathematics using a standard programming language or a more abstract 
algorithmic framework like that of recursive function theory. Another way is to 
change from classical to intuitionistic logic. The advantages of this second way are, 
first, that nonalgorithmic “decisions” are automatically barred by the logic, and, 
second, that the resulting mathematics looks like the mathematics we are used to 
from school and university, without any special logical notation such as is used in, 
for example, recursive function theory. 

In this book we explore mathematics with intuitionistic logic. We work through- 
out with notions, like that of “infinitely many” discussed earlier, that have positive 
computational meaning; and we present only algorithmic proofs — ones that show 
how we can, at least in principle, construct the objects whose existence is asserted 
in the statement of a theorem. We hope to convince the reader that, contrary to a 
widely held belief, intuitionistic logic suffices for the development of deep, interest- 
ing mathematics and often opens up new vistas that are hidden by classical logic. 
In other words, we want to justify our belief in the power of positive (constructive) 
thinking in mathematics. 
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1.2 A Very Brief History 

Although luminaries such as Leopold Kronecker had advocated a constructive ap- 
proach to mathematics in the nineteenth century, the story of modern construc- 
tivism really begins with the publication, in 1907, of the doctoral thesis “On 
the Foundations of Mathematics” [41], in which the Dutch mathematician L.E.J. 
Brouwer introduced his intuitionistic mathematics (INT) as an alternative to tra- 
ditional classical mathematics (CLASS). According to Brouwer, mathematical ob- 
jects are free creations of the human mind, independent of both logic and language, 
and a mathematical object comes into existence precisely when it is constructed. 
Such a belief naturally leads to a rejection of existence proofs by contradiction, and 
a consequent scepticism about the meaning of many of the theorems of CLASS. Not 
surprisingly, Brouwer’s views met with at best indifference, and at worst hostility, 
from the large majority of his peers, for whom the elimination of nonconstructive 
arguments, with all their apparent power and fruitfulness, was too great a price to 
pay for a clarification of the meaning of mathematics. 

If we adhere to the principle that “existence” should always be interpreted 
constructively, then we are forced to dispense with the unrestricted use of the 
logical law of excluded middle (or excluded third), 

P or (not P), 

which we shall abbreviate to LEM. Recognising this consequence of his philosophical 
views, Brouwer went as far as to claim, 

The belief in the universal validity of the principle of the excluded third in math- 
ematics is considered by the intuitionists as a phenomenon of the history of civ- 
ilization of the same kind as the former belief in the rationality of 7T, or in the 
rotation of the firmament about the earth [44]. 

Subsequently, he introduced into INT some principles that led to results ap- 
parently contradicting aspects of classical mathematics. For example, Brouwer was 
able to prove that any real-valued function on [0, 1] is uniformly continuous. But 
to regard Brouwer’s mathematics as inconsistent with its classical counterpart is a 
serious oversimplification of the situation, since the two types of mathematics are 
in many respects incomparable. Nevertheless, there was, and remains, a commonly 
held belief that too much mathematics has to be given up in order to accommodate 
Brouwer’s ideas. For example, Hilbert expressed his disagreement with Brouwer in 
words both forceful and memorable: 

Forbidding a mathematician to make use of the principle of excluded middle is 
like forbidding an astronomer his telescope or a boxer the use of his fists [54]. 

Despite continuing opposition, intuitionism survived and new constructive ap- 
proaches to mathematics arose. In 1948-1949 in the former Soviet Union, A. A. 
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Markov initiated a programme of recursive constructive mathematics (RUSS ) — 
mathematics using intuitionistic logic and based on the Church-Markov-Turing 
thesis that all computable partial functions from the set N of natural numbers to 
itself are recursive. This approach led to a number of technical successes [66, 67]. 
RUSS does not use any of Brouwer’s nonlogical intuitionistic principles; indeed, it 
could not, since it produces results that are false if interpreted directly within INT. 
For example, in RUSS there exists a continuous real- valued map on [0, 1] that is not 
uniformly continuous; more dramatically, there exists a uniformly continuous map 
/ from [0, 1] onto (0, 1] that has infimum equal to 0. Once again, one should not 
overreact to the apparent conflict with classical mathematics: the last of these re- 
sults should really be interpreted as saying that there exists a recursively uniformly 
continuous recursive function / from the closed interval [0, 1] of the recursive real 
line onto the recursive interval (0, 1] that has infimum equal to 0. Put this way, 
the result does not conflict with CLASS; indeed, it is a result of CLASS, since 
the proof within R.USS is actually a proof within CLASS that does not use such 
nonconstructive logical principles as LEM. 

By the mid-1960s, constructive mathematics was, when compared with its clas- 
sical counterpart, virtually stagnant. The situation changed in 1967 with the pub- 
lication of Errett Bishop’s monograph Foundations of Constructive Mathematics 
[9]. This book and its offspring [12] represent the most far-reaching and systematic 
presentation of constructive analysis to date. In [9], Bishop revealed, by thorough- 
going constructive means but without resorting to either Brouwer’s principles or 
the formalism of recursive function theory, a vast panorama of constructive mathe- 
matics, covering elementary analysis, metric and normed spaces, abstract measure 
and integration, the spectral theory of selfadjoint operators on a Hilbert space, 
Haar measure, duality on locally compact groups, and Banach algebras. Bishop’s 
constructive mathematics ( BISH ) was founded on a primitive, unspecified notion of 
“algorithm”, or “finite routine”, and the Peano properties of natural numbers, and 
kept strictly to the interpretation of “existence ” as “computability” . His refusal to 
pin down the notion of algorithm led to criticism, particularly from philosophers of 
mathematics and from those committed to the Church-Markov-Turing thesis; but 
this very imprecision enabled Bishop’s work to have a variety of interpretations: his 
results are valid in CLASS, INT, R.USS, and all reasonable models of computable 
mathematics, such as the more recent one propounded by Weihrauch [91]. Indeed, 
from a purely formal viewpoint, each of INT, R.USS, and CLASS can be regarded 
as BISH plus some additional principles: INT can be regarded as BISH supple- 
mented by Brouwer’s continuity principle and fan theorem; R.USS as BISH plus 
the Church-Markov-Turing thesis; and CLASS as BISH plus the law of excluded 
middle. 

One consequence of this multiplicity of interpretations is that we can often 
demonstrate that certain propositions P are independent of BISH; that is, neither 
P nor (not P) can be proved within BISH. For example, since “every mapping 
from [0, 1] into R is uniformly continuous” is a theorem of INT, and “there exists 
a continuous map of [0, 1] into R that is not uniformly continuous” is a theorem of 
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RUSS, and since both INT and RUSS are formally consistent with BISH, within 
BISH we cannot expect either to prove that every continuous map of [0, 1] into R. 
is uniformly continuous or to construct an example of a real-valued function that 
is defined, but not uniformly continuous, on [0,1]. 

Over the years since the publication of Bishop’s book, it became clear to a num- 
ber of researchers that, in essence, BISH is simply mathematics with intuitionistic 
logic together with some appropriate set-theoretic foundation. As we pointed out 
at the end of the preceding section, working with intuitionistic logic automatically 
bars noncomputational steps. As long as we keep strictly to intuitionistic logic, 
having made sure that our set-theoretic principles do not inadvertently imply LEM 
or some other nonconstructive proposition, the mathematics we develop turns out 
to be predictive, in the sense that every proof implicitly shows that if we perform 
certain calculations, we shall achieve certain results. Accordingly, when we speak 
of “constructive mathematics” or “BISH” in future, we shall mean “mathematics 
with intuitionistic logic”. It therefore behooves us to explain more clearly exactly 
what intuitionistic logic is. 


1.3 Intuitionistic Logic 


The meaning doesn’t matter if it’s only idle chatter of a transcendental kind. 

— W.S. Gilbert, Patience 

Everywhere one seeks to produce meaning, to make the world signify, to render 
it visible. 


— Jean Baudrillard, Seduction, or the Superficial Abyss 


For Brouwer, mathematics took precedence over logic. In order to describe the logic 
used by the (intuitionist) mathematician, it was necessary first to analyse the math- 
ematical processes of the mind, from which analysis the logic could be extracted. 
In 1930, Brouwer’s most famous pupil, Arend Heyting (1898-1980), published a 
set of formal axioms that so clearly characterise the logic used by the intuitionist 
that they have become universally known as the axioms for intuitionistic logic [52] . 
These axioms capture the so-called BHK interpretation of the connectives 

V (or), A (and), =» (implies), -> (not) 


and quantifiers 


3 (there exists), V (for all/each), 


which we now outline. 
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► P V Q : either we have a proof of P or else we have a proof of Q. 

► P A Q : we have both a proof of P and a proof of Q. 

► P =>■ Q : by means of an algorithm we can convert any proof of P into a proof 
of Q. 

► —iP : assuming P, we can derive a contradiction (such as 0 = 1); equivalently, 
we can prove (P => (0 = 1)) . 

► 3x P(x) : we have (i) an algorithm that computes a certain object x , and (ii) an 
algorithm that, using the information supplied by the application of algorithm 
(i), demonstrates that P(x) holds. 

► \/x £ AP(x) : we have an algorithm that, applied to an object x and a proof 
that x £ A, demonstrates that P(x) holds. 

Note that in the interpretation of the statement Vx £ AP(x), the proof of 
P(x) will normally use both the data describing the object x and the information 
supplied by a proof that x belongs to the set A. This is an important point, since 
upon it hinges a key argument against the use of the axiom of choice in constructive 
mathematics. We shall return to this matter later. 

A property P(x) is said to be decidable if for each x to which it might be 
applicable we have 

P(x) V — 'P(x), 

where the disjunction and negation are given their BHK interpretations. Even for 
a decidable property P(n) of natural numbers n the property 

VnP(n) V ->VnP(n), 

and hence a fortiori LEM, will not hold in general. As a result, many classical 
results cannot be proved constructively, since they would imply LEM or perhaps 
some other manifestly nonconstructive principle. 

To illustrate this point, consider the following simple statement, the limited 
principle of omniscience (LPO): 

Va € {0, 1} N (a = 0Va/0), 

where a = (ai, < 22 , . . .) , N + = {1,2,...} is the set of positive integers, {0, 1} N is 
the set of all binary sequences, and 


a = 0 Vn (a n = 0) , 
a/0 -4=>- 3 n (■ a n = 1) . 

In words, LPO states that for each binary sequence (a„)„^i, either a n = 0 for all 
n or else there exists n such that a n = 1. Of course, this is a triviality from the 
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viewpoint of classical logic. But its BHK interpretation is not so simple: it says 
that there is an algorithm that, applied to any binary sequence a, either verifies 
that all the terms of the sequence are 0 or else computes the index of a term equal 
to 1. Anyone familiar with computers ought to be highly sceptical about such an 
algorithm, since in the case a = 0 it would normally need to test each of the 
infinitely many terms a n in order to come up with the correct decision. 

For such reasons we feel justified in not accepting LPO, or any classical propo- 
sition that constructively implies LPO, as a valid principle of constructive mathe- 
matics. But we have another reason for not doing so: it can be shown that there 
are models of Heyting arithmetic — Peano arithmetic with intuitionistic logic — in 
which LPO is false; so LPO cannot be derived in Heyting arithmetic (see [34, 48]). 
Since LPO is a special case of the law of excluded middle, we are led, in turn, to 
renounce the latter when working constructively. Similar informal analyses lead us 
to exclude both the classical rule 


— P =► P, 

which forms the basis of proof by contradiction, and the following lesser limited 
principle of omniscience ( LLPO ), which is easily seen to be a consequence of LPO: 

For each binary sequence a with at most one term equal to 1 (in the sense 
that a m a n = 0 for all distinct m and n), either a 2 n = 0 for all n or else 
<i 2 n+i = 0 for all n. 


The exclusion of such principles from constructive mathematics has serious con- 
sequences for mathematical practice. For example, we cannot hope to prove con- 
structively the simple statement 

Mx € R (x = 0 V x ^ 0) , (1.1) 

where R. denotes the set of real numbers and a: / 0 means that we can compute 
a rational number strictly between 0 and x (which, as we shall see when we deal 
with the real numbers more formally in Chapter 2, is not the same, constructively, 
as proving that -> {x = 0)). To prove this, consider any binary sequence a, and use 
it to define the binary expansion of a real number 

OO 

X = ^ a n,2~ n - 
n = 1 

If x = 0, then a = 0. If x ^ 0, we can compute a positive integer N such that 

OO 

x > 2 ~ n = J2 2 ~"; 

n=N+l 

it is then clear that, by testing the terms ai, . . . , ajv, we can find n it N such that 
a n = 1. Thus statement (1.1) about real numbers implies LPO and is therefore 
essentially nonconstructive. 
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If the binary sequence a has at most one term equal to 1, then we can use the 
real number 

OO 

£(-!)" o„2-" 

n—1 


to show that the statement 


\/x € R. (x ^ 0 V x < 0) 


implies LLPO. 

The following elementary classical statements also turn out to be nonconstruc- 
tive. 


> Each real number x is either rational or irrational (that is, x ^ r for each 
rational number r). To see this, consider 


* = £ 


i dji 


where a is any increasing binary sequence (that is a binary sequence such that 
a n ^ a n + 1 for each n). 

t> Each real number x has a binary expansion. Note that the standard interval- 
halving argument for “constructing” binary expansions does not work, since 
we cannot necessarily decide, for a given number x between 0 and 1, whether 
x^l/2ora;^l/2. In fact, the existence of binary expansions is equivalent to 
LLPO. 


t> The intermediate value theorem, which is equivalent to LLPO. 

> For all ijel, if xy = 0, then either x = 0 or y = 0. The constructive failure 
of this proposition clearly has implications for the theory of integral domains. 


We emphasise here that classically valid statements like “each real number is either 
rational or irrational” that imply omniscience principles are not false in constructive 
mathematics; they cannot be, since BISH is consistent with CLASS. 


One principle whose constructive status is controversial is Markov’s principle 
(MP): 

Va e {0, 1} N+ (n( a = 0)=ta/0); 


in words, for any binary sequence a, if it is impossible for all the terms to equal 
0, then there exists a term equal to 1. In order to accept this as a principle of 
constructive mathematics, you have to be convinced that the information conveyed 
by the antecedent -> (a = 0) is sufficient to enable us to compute an index n with 
a n = 1. The argument in favour of MP says that we can carry out this computation 
by searching systematically through the terms a n , since the hypothesis -> (a = 0) 
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guarantees that we shall eventually stumble across a term equal to 1. The counter- 
argument is that the antecedent provides us with no prior bound for such a search — 
it does not tell us how many terms we need to test before we arrive at one equal to 
1 — so the search might go on longer than the remaining life of the universe before 
it produced the desired result. Moreover, Markov’s principle, like LPO, cannot be 
proved within Heyting arithmetic. For these reasons, we shall follow the normal 
practice of excluding MP from the working principles of constructive mathematics. 
As a consequence we exclude the even stronger logical principle 

(Vcc € A{P{x) V ~^P{x)) A -iVx € A->P(x)) => 3x e AP{ x), (1.2) 

where A is a well-defined set (the exact meaning of “well-defined set” will become 
clear in the next section). In fact, even if we were to accept Markov’s principle on 
the grounds that an unbounded search through the natural numbers that cannot 
fail to terminate must eventually do so, we would balk at accepting (1.2), since for 
a general set A there will be no natural order allowing us to search systematically 
in the way we can with N. 

An example of the type dealt with earlier, in which a classically valid proposition 
P is shown constructively to entail an essentially nonconstructive principle like 
LEM, LPO, LLPO, or even MP, is called a Brouwerian counterexample to P (even 
though it is not a counterexample in the true sense of the word; it is merely an 
indication that P does not admit of constructive proof). There is another expression 

OO 

that we may use in this context. For example, we refer to the number x = a n 2~ n 

n= 1 

that we constructed from a given binary sequence (a n ) n>1 and then used to show 
that (1.1) implies LPO as a Brouwerian example of a real number x for which we 
cannot decide whether x = 0 or x ^ 0. 


1.4 Informal Set Theory 

Die ganzen Zahlen hat der liebe Gott gemacht, alles andere ist Menschenwerk. 

— L. Kronecker [90] 

The primary concern of mathematics is number, and this means the positive 
integers. We feel about number the way Kant felt about space. The positive 
integers and their arithmetic are presupposed by the very nature of our intelligence 
and, we are tempted to believe, by the very nature of intelligence in general. The 
development of the positive integers from the primitive concept of the unit, the 
concept of adjoining a unit, and the process of mathematical induction carries 
complete conviction. In the words of Kronecker, the positive integers were created 
by God. 

— Errett Bishop [9] 

Building on the set of positive integers and using intuitionistic logic, we follow 
Bishop’s approach to developing constructive mathematics at higher and higher 
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levels of abstraction. To do this, we need to clarify notions such as “set” and “func- 
tion”. 

For us, a set X (other than our basic, primary set N + of positive integers) is 
given by two pieces of data: 

> a property that enables members of X to be constructed using objects that have 
already been constructed (note this last phrase, which rules out the possibility 
of impredicative definitions and therefore of Russell- type paradoxes), and 

> an equivalence relation =x of equality between members of X. 

We write x £ A to signify that x is an element of the set A, and x (j A instead of 
(x € A) . 

The use of equivalence relations rather than intensional equality — that is, iden- 
tity of description — is common, but often goes unnoticed, in classical mathematics. 
For example, we call the rational numbers 1/2 and 3/6 equal, even though, strictly 
speaking, they are equivalent and not intensionally identical. 

A subset S' of a set X consists of a collection of elements drawn from X, together 
with the equality relation induced on S by the given equality on X\ that is, for 
elements x, y of S, we define 


x=s V < = => x = x y- 

We write ScTto signify that S is a subset of T. If P(x) is a property applicable 
to certain elements a: of a set A, then we denote by 

{x : x € A A P(x)} 


or 

jiei: P(a;)} 

the subset of A consisting of those elements a; of A with the property P(x ) . 

The logical complement of a subset S of X is 

->S = {x G A : x </ S} . 

A particular example of this is the empty subset of X, defined by 

0x = — X. 

We say that a subset S' of A is inhabited if 

3x (x £ S) . 

We then write S ^ 0x ■ Note that in order to show that S is inhabited, we cannot 
just prove that it is impossible for S to be empty; we must actually construct an 
element of S; see Exercise 2. 
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Two sets X , Y are said to be equal sets if each is a subset of the other; in other 
words, if the sets have the same elements and the same equality relation. 

We need to be careful when constructing new sets from old. Since an equality is 
part of the data for a set, it does not make sense to talk of the union S U T of two 
sets unless we can put together not only the sets as collections of objects but also, 
in some way, their given equality relations. In practice, this means that in order to 
construct their union SUT, the sets S and T must be given as subsets of some set 
X. We then define 

SUT = {x G X : x G SV x GT}, 

where the equality on S U T is that induced by X. Likewise, the intersection of S 
and T is defined only when S and T are subsets of some set X, and is then the 
subset 

S HT = {x G X : x G S A x G T} 


of X. 


The (Cart.esian) product of two sets X,Y is the set X x Y consisting of all 
ordered pairs (x, y) with x G X and y GY, together with the equality given by 


((x, y) —x xY (x', y’)) (x = x x' Ay = Y y') ■ 


In many situations — even, as we shall see, on the real line — we frequently need 
a set X to be equipped with an inequality relation describing what it means 
for two elements of X to be unequal , or distinct. Such a relation must satisfy the 
following two properties: 


x^x y =4- -> (x = x y ) , 
x^xy =4- y ± x x. 

If, in addition, 

(x ^x y) => x = x y , 

we say that the inequality is tight. A set X with an inequality is discrete if, for any 
two elements x and y of X , either x =x y or x ^x y\ we then also describe the 
inequality itself as discrete. 

One inequality relation, the denial inequality, is defined by setting x ^x y if 
and only if -> (x =x V ) • This inequality is normally too weak for practical purposes. 
For example, in the absence of Markov’s principle, on the real line R the property 
-> (x =r 0) is weaker than |ar| > 0 (Exercise 3); for that reason we define the standard 
inequality on R to be not the denial inequality but the one given by 

x^r y \x-y\> 0. 

From now on, when the meaning is clear from the context, we write 

=, 0 , ••• 
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rather than 

=X, &X, 

A subset S' of a set X with an inequality has a complement, defined by 
'vS={teI:Vse5(a;/s)}. 

Then ~S C ^S; but unless the inequality on X is the denial inequality, the reverse 
inclusion will not hold. 

A subset S of a set X is said to be detachable (from, or in, X) if 

\/x £ X (x £ S V x S) . 

Since statement (1.1) implies LPO, not even the singleton subset {0} is detachable 
from R. However, {0} is detachable in the set Q of rational numbers. 

When X comes with an inequality relation, we define the inequality on any 
subset S of X to be the one induced by that on X: 

x^sy x y- 

If also Y has an inequality relation, we define the inequality on the Cartesian 
product X x Y by 

(0, V) /xxy (x', y')) {x x'Wy^v y') ■ 

It should be no surprise that we require functions to be given by algorithms and 
to respect equality. Thus a function f from a set X to a set Y — also called a map 
or mapping of X into Y , and written f : X — > Y — is an algorithm that, applied 
to any element x of X, produces an element f(x) of Y such that / is extensional: 

\/x € X\/x' € X (x =X x' => f(x) =Y f(x')) ■ 

The element f(x) is called the value of f at x or the image of x under f. If X 
and Y have inequality relations, then we may require / to be strongly extensional : 

Vx £ X\/x' £ X (f(x) ^Y fix') => x j^x x') ■ 

Note that the statement “all functions from R to R are strongly extensional” is 
equivalent to Markov’s principle. 

Let /, g be two real- valued functions on a set X with an inequality. We say that 
an element xo of X is the strongly unique element of X such that f(x) = g(x) if 
f(x o) = g(x o) and f{x) ^ g{x) whenever x € X and x ^ Xq- 

Strong uniqueness will resurface in Chapter 4 in connection with best approxima- 
tions. 

A partial function f : X — > Y is a function from a subset of X into Y. The 
subset 
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{x £ X : f(x) is defined} 

is called the domain of /, denoted by dom(/) ; and the set 

{y G Y : 3 x£ X{y= Y f(x))} 

the range of /, denoted by ran (/) . The image of a subset A of X under f is the 
set 

f{A) = {/ 0) : x £ A}. 

The inverse image of a subset B of Y under / is the set 

= {x £ dom (/) : f(x) £ B} . 

A partial function / : X — > Y is said to be a total partial function on X if 
dom(/) = X. 

When the expression describing f(x) is given explicitly and the domain of the 
partial function / is clearly understood, we may denote the function by x f(x). 
For example, the partial function from R. to itself whose value is defined, for each 
real number x such that x y^R 0, to be 1/x may be written x 1/x. 

An important type of total partial function is defined as follows. Let X and / 
be sets. A family of elements of X with index set I (or indexed by I) is a mapping 
i Xi of I into X; we commonly denote this family by ( xf) ieI . In particular, if / 
is N + , the family is called a sequence in X and is usually written (x n ) n>1 . More 
general sequences of the form (x n ) n>N have the obvious analogous meaning when 
N is an integer. 

Let / and g be mappings from subsets of a set X into a set Y. where Y is 
equipped with a binary operation O. We introduce the corresponding pointwise 
operation O on f and g by setting 

(fOg){x) = f(x)Og(x) 

whenever f(x) and g(x) are both defined. Thus, taking Y = R, we see that the 
(pointwise) sum of / and g is given by 

(f + g)(x) = f{x) + g(x) 

if /( x) and g{ x) are both defined; and that the (pointwise) quotient of / and g is 
given by 

(// g)(x) = f(x)/g(x) 

if f(x) and g(x) are defined and g(x) ^ 0. If X = N + , so that / = (x n ) n ^i and 
g = (yn)n^i are sequences, then we also speak of termwise operations', for example, 
the termwise product of / and g is the sequence {x n y n ) n ^ 

Pointwise operations extend in the obvious ways to finitely many functions. In 
the case of a sequence (/ n ) n > l of functions with values in a normed space (see 
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Chapter 2), once we have introduced the notion of a series in a normed space we 

OO 

shall interpret ^ fn hr the obvious pointwise way. 

71=1 

A partial function / : X — > Y can be identified with its graph, 

G(f) = {{x,y) GX xY :xG dom (/) , y G ran (/) , y = f(x)}, 

a subset of the Cartesian product XxY. We define two partial functions / : X — > Y 
and g : X — > Y to be equal if their graphs are equal as sets. Thus f, g are equal 
partial functions if and only if their domains are equal subsets of X and f(x) = g(x) 
for each x G dom(/). 

We say that a partial function / : X — > Y is 

► one-one if f(x) = f(x') entails x = x'; 

► injective if X and Y are equipped with inequality relations, and x x’ entails 
f{x) ± 

If / is injective and the inequality on X is tight, then / is one-one: for in that case, 
if f(x) = f(x'), then ~<(x ^ x') and so, by tightness, x = x’ . 

The composition, or composite, of partial functions / : A — > B and g : B — > C 
is the partial function go f : A — > C (sometimes written gf) defined by go f(x) = 
g(f(x)) wherever the right side exists. 

A partial mapping / : X — > Y maps its domain onto Y if 
My G Y3x G X(y = /( x)) . 

On the other hand, we say that f : X — > Y is an epimorphism if there exists a 
mapping g : Y — > X such that 

My GY (/ (g(y)) = y) . 

A one-one partial function / : X — > Y has a one-one inverse 

f _1 : ran (/) — ■> dom(/) 

defined by / _1 (/(x)) = x. If / is injective, then its inverse is strongly extensional. 
A bijection between X and Y is a one-one mapping from X onto Y. 

The subtle distinction between mappings onto and epimorphisms is closely 
linked to the constructive status of the axiom of choice, which we shall discuss 
shortly. First, though, we introduce some notions of cardinality. 

Let S' be a subset of a set X with an inequality relation. We say that S is 


> finitely enumerable if there exist a natural number N and a mapping of the set 
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{1,2,..., N} = {n£ N + : n < N} 


onto S; 

t> finite if there exist a natural number N and an injective map of {1,2,..., N} 

onto S. 

When we speak of finitely many objects, we mean that those objects constitute an 
inhabited, finitely enumerable, but not necessarily finite, set. 

Note that the case TV = 0 of the definition of “finitely enumerable” shows 
that the empty subset of X is finitely enumerable. Clearly, finite implies finitely 
enumerable; the converse does not hold constructively. 

We say that S is countable if there exist a detachable subset D of N + and a 
mapping <f> of D onto S. Every finitely enumerable subset — in particular, the empty 
subset — of X is countable. If D = N + , we call <f> an enumeration of S, in which 
case we may denote S by {</>(l), </>(2), . . .} . If also <f> is one-one, we say that S 
is countably infinite. A set is countably infinite if and only if it is the range of a 
one-one mapping whose domain is a countably infinite, detachable subset of N + . 

We now consider the axiom of choice (AC): 

If X, Y are inhabited sets, S is a subset of X x Y, and for each x £ X 
there exists y £ Y such that ( x , y) £ S, then there exists a choice function 
/ : X — > Y such that (x, f(x)) £ S for each x £ X. 

Under the BHK interpretation, the hypothesis 

\/x £ X 3y £ Y ((x, y) £ S) 

of AC means that we have an algorithm that, applied to each element x oi X 
and the data showing that x belongs to X , constructs an element y of Y and 
demonstrates that (x,y) £ S. This much is clear. However, there is no guarantee 
that the algorithm will respect the equality relation on X — in other words, that 
if x =x x' , and the algorithm constructs y,y' in Y such that (x,y) £ S and 
(x',y') £ S, then y =y y' ■ Indeed, we should expect that the computation of y 
might use data that are associated with properties intrinsic to x that do not apply 
intrinsically to x' . 

For example, anticipating our development of the real number set R, consider 
the case in which X is R and Y is N + . A real number is (defined as) a certain set of 
rational approximations. However, two equal real numbers x,x' can have different 
defining sets of rational approximations. In that case, the algorithm that computes a 
positive integer n such that (x, n ) £ S may, and in general will, compute a different 
positive integer n' such that (x',n') £ S. These considerations throw real doubt 
over the possibility that there is a choice function implementing the algorithm. 
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In fact, an argument of Diaconescu [46] and Goodman & Myhill [50], but prefig- 
ured by Bishop (see Problem 2 on page 58 of [9]), shows that AC cannot be allowed 
as a principle of constructive mathematics. 

Theorem 1.4.1. The axiom of choice implies the law of excluded middle. 

Proof. Let P be any constructively meaningful statement, and define the set X to 
consist of the two elements 0 and 1, together with the equality relation such that 

(0 =v 1) «=► P. 

Let Y be the set {0,1} with the standard equality, and let S be the subset 
{(0, 0), (1, 1)} of the Cartesian product X x Y, taken with the standard equality. 
Suppose there exists a function / : X — > Y such that (x, f(x)) G S for all x € X. 
There are three cases to consider: (i) /( 0) = 1, (ii) /( 1) = 0, and (iii) both /( 0) = 0 
and /( 1) = 1. In case (i) we have (0, 1) = (0, /( 0)) € S, so either (0, 1) =xxy (0, 0) 
or (0,1) =xxY (1,1). If the first of these two alternatives holds, then, by defini- 
tion of the equality on X x Y, we have 1 =y 0, which is absurd. Hence, in fact, 
(0,1) =xxY (1,1) ■ Thus, again by definition of the equality on X x Y, we have 
0 =x 1 and therefore P holds. Case (ii) similarly leads to the conclusion that P 
holds. Finally, in case (iii) we have -i (/( 0) =y /(l)) ; therefore, since / is a func- 
tion, — ■ (0 =x 1) and so -*P holds. Thus we have derived P V -*P from AC. □ 


The axiom of choice will hold constructively if the set X is one for which no 
computation is necessary to demonstrate that an element belongs to it; Bishop calls 
such sets basic sets. Following the practice of most constructive mathematicians, 
we consider N + , the set 

N = {0, 1, 2, 3, . . .} 

of natural numbers, and the set 

Z = {0, ±1, ±2, . . .} 

of all integers to be basic sets. This practice is reflected in our acceptance of the 
principle of countable choice: 

If Y is an inhabited set, S is a subset of N + x Y, and for each positive 
integer n there exists y £ Y such that (n, y) G S, then there is a function 
f : N + — > Y such that ( n , /(n)) € S for each n G N + . 

In fact, many constructive proofs use the stronger principle of dependent choice: 

If X is a set, a £ X, S is a subset of X x X, and for each x € X there 
exists y € X such that (x,y) G S, then there exists a sequence {,x n ) n>1 in 
X such that x\= a and (x n ,x n +i) G S for each n G N + . 
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Another contentious matter in constructive mathematics is the status and role 
of the power set V (X) of a given set X : that is, the collection of all subsets of X, 
with equality of subsets as defined earlier. The main objection to admitting V (A) 
into the constructive fold is that we thereby allow impredicativity, since there is 
then nothing to stop us constructing subsets of X whose defining characteristics are 
self-referential. On the other hand, nobody has yet shown that adding the power 
set axiom 

VA3FVS (ScX^SgY) 

to constructive mathematics enables us to prove LEM or some other incontestably 
nonconstructive principle. 

There are ways of avoiding the power set. It often suffices to work with the 
set Y x of all mappings from X into a set Y. Since we have a clear idea of what 
mappings from X into Y are (something we do not have for subsets of A), the set 
Y x seems relatively innocent. Note that classically the set {0, 1} A can be identified 
with the power set of A, since it comprises the characteristic functions of subsets of 
A. This identification is not possible constructively, since characteristic functions 
exist only for those subsets of A that are detachable. 

Another way to avoid the full generality of the power set is to work with a well- 
defined but smaller set of subsets of A. For example, the set of compact subsets of 
a metric space A is well defined (it is actually a metric space itself), and is often 
all we need for many parts of analysis. 

Let S (A) be a well-defined set of subsets of A, with two elements taken as equal 
if and only if they are equal sets in the usual sense, and let I be some set. Then we 
can speak sensibly about a family of elements of S (A) . We can also define 

the union and intersection of such a family to be, respectively, the subsets 

[jSi = {x G A : 3i G I(x G Si)} 
iei 

and 

p| St = {s G A : Vi G / (a: G $)} 
iei 

of X. If I is the set of positive integers, we denote the above union and intersection 
by U Sn and fj S n , respectively. 

n^l n^l 

The Cartesian product S) is the subset of X 1 consisting of those functions / 
iei 

such that f(i) £ Si for each i £ I; if A comes with an inequality relation, then the 

corresponding inequality on Si is defined by 

iei 

f^g^3iel(f(i)^g(l)). 

If I is the set of positive integers, then (,S))i e / is a sequence of sets, and the elements 
of J~[ Si are also sequences; in this case, the sets Si can be arbitrary and need not 
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be subsets of a previously defined set X. If I = {1, 2, . . . , n}, then we denote the 
product n^bySiX^x-.-x S n , and refer to its elements {x\,x- 2 , . . . , x n ) as 

iei 

ordered n-tuples, or, in the case n = 2, ordered pairs (something we used informally 
in our earlier definition of the Cartesian product of two sets). 

Other set-theoretic notions will be introduced later as they arise. It is now time 
to turn our attention away from foundational matters to analysis proper. 


Exercises 


Although we shall not formally define the real number line R until Chapter 2, in 
these exercises we assume elementary properties of real numbers where necessary. 


1. Justify informally Brouwer’s observation that ^^^P implies ->P. Using this, 
show that the proposition (->^P => P) is equivalent to the law of excluded 
middle. 

2. Prove that if the statement 

—i(S = 0) => 5^0 

applies to all subsets S of {0} , then the law of excluded middle holds. 

3. Prove that 

\/x £ R (-> (x = 0) => x ^ 0) 
is equivalent to Markov’s principle. 

4. Fill in the details of the proof that the statement 

\/x € R (x ^ 0 V x < 0) 

implies LLPO. 

5. Give a Brouwerian counterexample to the proposition that every real number 
x is either rational or irrational (where u x is irrational” means that x ^ r for 
each rational number r). 

6. Prove that every real number has a binary expansion if and only if LLPO holds. 

7. Give a Brouwerian counterexample to the statement that if ?’i,r2, and r 3 are 
real roots of a quadratic polynomial x 2 + ax + b with a, b £ R, then there exist 
distinct i. j with r* = rj. 
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8. Prove that the statement “for all real numbers x, y that have binary expansions, 
the sum x + y has a binary expansion” is equivalent to LLPO. 

9. Prove that the statement 

\/x, y € R. ( xy = 0 => (x = 0 V y = 0)) 
is equivalent to LLPO. 

10. Prove that the intermediate value theorem is equivalent to LLPO. 

11. Let / : [a, b] — ■> R be sequentially continuous in the following sense: for each 
sequence (x n ) n>1 in [a, b] that converges to a limit x, the sequence (f(x n )) n>1 
converges to f(x). Suppose also that f(a)f(b) < 0, and that / is locally nonzero 
in the sense that for each x € [a, b] and each r > 0 there exists y € [a, 6] 
with \x — y\ < r and f(y) ^ 0. Prove that there exists c € (a, b) such that 
/(c) = 0. (This version of the intermediate value theorem suffices for virtually 
all constructive purposes.) 

12. Prove that the statement “all functions from R to R are strongly extensional” 
is equivalent to Markov’s principle. 

13. Discuss the statement “every mapping from a set X onto a set Y is an epimor- 
phism” . 

14. Give a Brouwerian counterexample to the statement “every subset of a finite 
set is finitely enumerable”. 

15. Prove that an inhabited set is countably infinite if and only if it is the range of 
a function with domain N. 

16. Prove that a subset S of N is countable if and only if there exists a sequence 
(S' n ) ra >i of finite subsets of N such that Si C S 2 C S3 C • • • . 

17. Prove that the statement “every inhabited subset of N is countable” implies 
the principle of finite possibility (PFP): to each binary sequence (a n ) n>1 there 
corresponds a binary sequence (b n ) n>1 such that a n = 0 for each n if and only 
if there exists N such that 6 at = 1. 

18. Prove that the intersection of two countable sets is countable. 

19. Prove that the principle of dependent choice implies the principle of countable 
choice. Prove also that the principle of dependent choice can be derived from 
the following principle of internal choice: if X is an inhabited set, S is a subset 
of X x X, and for each x € X there exists y € X such that (x,y) € S, then 
there exists a choice function / : X — > X such that (x, f{x)) € S for each 
x € X. 
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20. By a filter on a set X we mean a set $ of inhabited subsets of X with the 
following properties: 

• If S' and T belong to then S fl T £ $. 

• If S £ $ and S C T C X, then T £ 

A filter il is called an ultrafilter if for all S C X, either S £ il or ->S £ il. The 
classical ultrafilter principle states that every filter is contained in an ultrafil- 
ter. Prove that this principle implies the weak limited principle of omniscience 
(WLPO): 

Va £ 2 n (Vn ( a n = 0) V ~>\ In (a n = 0)) . 


Notes 

The use of the Golclbach conjecture in the example in Section 1.1 is in no sense a 
constraint on the argument; if the Goldbach conjecture were solved tomorrow, we 
could easily replace it by any one of many open problems of number theory. Indeed, 
until Wiles proved the Fermat conjecture in 1994, it was common to use that, rather 
than the Goldbach conjecture, in such Brouwerian examples of nonconstructivity. 

The reader interested in the history of foundations of mathematics should con- 
sult [44] and [84] for more information on the “Grundlagenstreit” that eventuated 
between Brouwer and Hilbert in the 1920s. For more on Hilbert’s programme for 
the secure foundation of mathematics see [93] . 

The designation “BHK interpretation” comes from Brouwer, Heyting, and 
Kolmogorov. The BHK interpretation of implication, while more natural than the 
classical one of material implication, in which (P => Q) is equivalent to (->P V Q), 
has not completely satisfied all researchers using constructive logic. Shortly be- 
fore he died, Bishop communicated to Bridges his dissatisfaction with the standard 
constructive interpretation of implication. Unfortunately, he left nothing more than 
very rudimentary sketches of his ideas for its improvement. (Note, however, Bishop’s 
paper [10].) For a deeper analysis of the constructive interpretations of the connec- 
tives and quantifiers, we refer the reader to [48]. 

The classical invalidity of the recursive interpretation of LPO is not a matter 
of logic: it can be demonstrated, even with classical logic, that a recursive version 
of LPO would lead to a proof of the decidability of the halting problem, which is 
known to be impossible; see [34], pages 52-53. 

Anclrej Bauer has recently shown that proofs and results in BISH can be trans- 
lated into Weihrauch’s Type-2 Effectivity framework by a realisability interpreta- 
tion [5]. 
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The principle (->-iP => P ) is equivalent to LEM even with intuitionistic logic; 
see Exercise 1. 

For the confirmed intuitionist there is at least one other reason for rejecting MP: 
it contradicts Brouwer’s (admittedly controversial) theory of the creating subject, 
an add-on to his intuitionistic mathematics. See [34] (pages 116-117) and, for a 
rather different view, [73]. 

Bishop used “subfinite” instead of “finitely enumerable” . Note that a subset of 
a finite set need not be finitely enumerable. 

In the proof of Theorem 1.4.1 we could have defined X as a set of equivalence 
classes under the equivalence relation ~ defined by 

(0 ~ 1) 4=> P, 

but it is more in keeping with Bishop’s approach to proceed by considering a special, 
if unusual, equality relation on X. 

Some constructive mathematicians argue against even the principle of countable 
choice; see, for example, [76, 82], 

Myhill has shown that, under the Church-Mar kov-Turing thesis, the power set 
of a singleton is uncountable: in other words, there is no recursive mapping of N 
onto that power set [74] (page 364, Theorem 3). Within BISH we may not be able to 
prove the uncountability of the power set of {0} , but we certainly have an unending 
supply of subsets S of {0} for which we cannot decide that S = 0 or S = {0}: given 
any constructively meaningful statement P, we can define a corresponding set 

S = {a: : x = 0 A P} 

such that (S = {0} V S = 0) if and only if P V ->P. 

Myhill’s formal theory— based on primitive notions of “set”, “function”, and 
“natural number” — is but one of several foundational theories advocated for con- 
structive mathematics. Others include Aczel’s constructive set theory [3], Martin- 
Lof’s type theory [69], and a largely unpublished constructive version of Morse’s 
set theory [16] in which membership of the universe appears to correspond to being 
constructively defined. For more on constructive foundational theories see [6] and 
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My deplorable mania for analysis exhausts me. 

— Gustave Flaubert, Letter (August 1846) 

We begin by using a form of interval arithmetic as a foundation for the construction of 
the real number line R. Having discussed the elementary algebraic and order-theoretic 
properties of real numbers, we prove that R. is complete in two senses: Dedekind (order) 
complete and Cauchy (sequentially) complete. The next step is to generalise from the 
reals to metric and normed spaces. A particularly important property for us is total 
boundedness, which plays a big part in proving the existence of suprema and infima; for 
that reason we need to prove that there are lots of totally bounded subsets in a totally 
bounded space. We also highlight the important property of locatedness for subsets of a 
metric space, a property that holds automatically in classical mathematics. 


2.1 The Real Number Line 


Constructive analysis proper, as distinct from arithmetic, begins with the real num- 
bers, which we shall construct, choice-free, by interval arithmetic. First, though, we 
observe that the purely algebraic constructions of the sets Z of integers and Q of 
rational numbers from N are carried out as in classical algebra, and that the stan- 
dard inequality on Q — and hence a fortiori on Z regarded as a subset of Q — is the 
(in this case discrete) denial inequality. 

By a real number we mean a subset x of Q x Q such that 
> for all ( q , q') in x, q < q'; 

t> for all (g, q') and (r, r') in x, the closed intervals [q, q'] and [r, r'] in Q intersect 

in points of Q; 
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> for each positive rational £ there exists ( g , g') in x such that q' — q < e. 

The last of these properties ensures that x is inhabited. It also legitimises our use 
of expressions like “pick an element of x”: to carry out such a selection, we simply 
take £ = 1 in the third of the defining properties of the real number x, to produce 
a corresponding element ( q , q') of x. 

The intuition underlying our definition of “real number” is that the elements 
of x are the rational endpoints of closed intervals with one point — namely x —in 
common. Any rational number q gives rise to a canonical real number 

q = 

with which the original rational q is identified. 

Two real numbers x and y are 

• equal, written x = y, if for all (g, q') € x and all (r, r') G y, the intervals [<7, <7'] 
and [r, r'] in Q have a rational point in common; 

• unequal (or distinct), written x 7^ y, if there exist ( q,q ' ) € x and (r, r') € y 
such that the intervals [q, q'\ and [ r,r'\ in Q are disjoint. 

It is almost immediate that 7^ satisfies the defining properties of an inequality 
relation. Let us check that equality is an equivalence relation. It is trivial that it 
is reflexive and symmetric, so only transitivity has to be handled. Let x = y and 
y = z, and suppose that for some ( q,q' ) € x and (r, r') G z there are no rational 
points in [g, q') fl [r, r'] . We may assume without loss of generality that q' < r. Using 
the third of the defining properties for a real number, choose (s, s') G y such that 
s' — s < r — q 1 . Now, the rational interval [s,s / ] intersects [q,q'] in a rational point 
u, and [r, r'] in a rational point v, so 

r — q 1 ^ v — u ^ s' — s < r — q' , 


a contradiction. Hence 

^{[q,^] n [r, r'\ = 0) . 

Since we are working with intervals in Q, with the aid of two simple lemmas we 
can turn this around to construct a point of [g, q'] fl [r, r '\ , and therefore complete 
the proof that x = z, as follows. 

Lemma 2 . 1 . 1 . Let a,b,c,d be rational numbers with a ^ b and c ^ d. Then there 

exists a rational number in [a, b\ ft [c, d] if and only if a ^ d and c ^ b. 

Proof. If r is a rational number in the intersection of the intervals, then a ^ r ^ d 
and c^r^b, so a^d and c ^ b. If, conversely, these conditions hold, then either 

c < a and therefore a G [a, b ] D [c, d ] , or else a < c and c G [a, 6] fl [c, d] . □ 
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Lemma 2.1.2. Let I , J be closed, bounded intervals in Q such that -> (/ fl J = 0) . 
TTien i/iere exists r £ Q such that r £ I fl J. 

Proof. Let I = [a, b] and J = [c,d] .lib < c, then /fl J = 0, a contradiction. Hence 
c ^ b. Likewise, a ^ d. It remains to apply Lemma 2.1.1. □ 


Taken with the equality and inequality we have defined above, the collection of 
real numbers forms a set: the real line R. 

Let x, y be real numbers. We say that x is greater than y, and that y is less than 

x, if there exist ( q , q') £ x and (r, r') £ y such that r' < q\ we then write x > y or, 
equivalently, y < x. On the other hand, we say that x is greater than or equal to 

y, and that y is less than or equal to x, if for all (q, q') € x and all (r, r') € y we 
have q' ^ r; we then write x ^ y or, equivalently, y ^ x. We write, for example, 
x y to indicate that -> (x > y ) , and x ^ y to indicate that -> (x ^ y) . 

Clearly, x ^ x and x x. Moreover, by Lemma 2.1.1, x = y if and only if both 
x ^ y and y ^ x; and x y if and only if either x > y or else x < y. 

Lemma 2.1.3. // x > y, then y x. 

Proof. Since x > y, there exist {q, q') £ x and (r, r') £ y such that r' < q. If 
also y > x, then there exist (s, s') £ x and ( t , t') £ y such that s' < t. By the 
defining conditions on real numbers, there exist rational numbers a , b such that 
a £ [q, q’] fl [s, s'] and b £ [r, r '] fl [t, t '] . But then 

a ^ s' <t^b^r'<q^a, 


a contradiction. 


□ 


Lemma 2.1.4. x)yi/ and only if y / x. 

Proof. By definition, x ^ y if and only if for all ( q , q') £ x and all (r, r') £ y we 
have q' ^ r, which occurs precisely when ->(y > x). □ 

Lemma 2.1.5. If x > y, then x ^ y. 

Proof. By Lemma 2.1.3, y x. The result follows from Lemma 2.1.4. □ 


Lemma 2.1.6. //x > 0, then there exists a positive integer n such that, x > 1/n. 
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Proof. First pick ( q , q') £ x such that 0 < q. Then choose a positive integer n such 
that q > 1 jn. The definition of “greater than” ensures that x > 1 jn. □ 


From time to time we shall revisit some of our earlier Brouwerian examples in 
the light of our formal definitions of real numbers and their properties. 


Proposition 2.1.7. The statement 

Vx,y £ K(-i(x > y)=> y > x) (2.1) 

implies Markov’s principle. 

Proof. Assume (2.1). Let {a n ) n>1 be an increasing binary sequence such that 

->Vn (a n = 0) , 

and define a real number by 

X = { (°’ «) : °" = °} U { t n) : = 1 “ an ” 1 } ' (2 ’ 2) 

Then -i (0 ^ x): for if 0 ^ q for all (q, q') £ x, then a n = 0 for all n, a contradiction. 
It follows from (2.1) that x > 0 and hence that there exists ( q,q ') £ x such that 
q > 0. Then (q,q r ) = ^) for a (unique) n such that a n = 1 — a n - 1 - □ 


Proposition 2.1.8. The relation > is cotransitive.' If a < b, then for all x £ R. 
either a < x or x < b. 

Proof. There exist (q, q') £ a and (r, r') £ b such that q' < r. Given a real number 
x, we can find (s, s') £ x such that s' — s < r — q' . If s' < r, then x < b; if s' ^ r, 
then q' < s and so a < x. □ 


Lemma 2.1.9. // x > y ^ z or x ^ y > z, then x > z and x ^ z. 

Proof. Assume, for example, that x > y ^ z. By Proposition 2.1.8, either x > z or 
z > y. Since the latter is ruled out by Lemma 2.1.4, we have x > z and therefore, 
by Lemma 2.1.5, x ^ z. □ 


Lemma 2.1.10. If (q, q') £ x, then q ^ x ^ q'. 

Proof. By Lemma 2.1.1, for each ( r, r ') £ x, since the rational intervals [r, r’) and 
[< 7 , q'] intersect, r' ^ q and q' ^ r. The desired conclusion now follows from the 
definition of the relation □ 
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Lemma 2.1.11. For each real number x there exist rational numbers q, q' such 
that q < x < q'. 

Proof. Let ( r,r ') be any element of x. By Lemma 2.1.10, r ^ x ^ r'. Choosing 
q,q 1 in Q with q < r ^ r' < q 1 , we see from the definition of the relation < that 
q < x < q'. □ 


The following two propositions show that the classical law of trichotomy does 
not hold constructively, even in the weak form discussed in Proposition 2.1.13. 

Proposition 2.1.12. The statement 

Vx G R. (x ^ 0 =>■ x > 0 V x = 0) 


implies LPO. 

Proof. Given an increasing binary sequence (a n ) n>1 , define the real number x as 
at (2.2). It is routine to check that x ^ 0; that if x = 0, then a n = 0 for all n; and 
that if x > 0, then there exists n such that a n = 1. □ 


Proposition 2.1.13. The statement 

Vx € R (x ) 0 V x ^ 0) 


implies LLPO. 

Proof. Let (a n ) n>1 be a binary sequence with at most one term equal to 1, and 
define a real number by 

x = { ^ : Vfc < n{a k = 0)| U | ^(-1)" (-1)" ^ : a n = 1 j . 

If x ^ 0, then it is impossible that a n = 1 for an odd n , so a n = 0 for all odd n. 
Likewise, if x ^ 0, then a n = 0 for all even n. □ 


Lemma 2.1.14. // x < y, then there exists s G Q such that x < s < y. 


Proof. Since x < y, there exist (q, q') G x and (r, r') G y such that q' < r. Then 
the rational number 


s = 


\ («' + r ) W + r ) 


has the desired property. 


□ 
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The maximum of two real numbers x and y is the set max {x, y} of all ratio- 
nal pairs of the form (max {q, r} , max {</, r 1 }) where (q,q') € x, ( r,r ') € y, and, 
for example, max {q, r} is the maximum in Q of the two rational numbers q , r. 
The minimum of x and y is the set min {x, y} of all rational pairs of the form 
(min {q, r} , min {</, r / }) where {q,q') € x, ( r,r ') £ y, and min{g , ,r / } is the mini- 
mum in Q of the two rational numbers q' , r' . The maximum, max {xi, . . . , x n } , and 
minimum, min {xi, . . . , x„} , of finitely many real numbers are defined analogously. 
We define the negative of a real number x to be 

x = { (q, q') e Q x Q : (-q'~ ?)ex}, 

where, for a rational number q , the expression - q denotes the negative defined in 
the usual way. The absolute value of the real number x is the set 

|x| = max {“x, x} . 

It is not hard to verify such properties as the following: 

• max{x,y}, min{x, y}, x, and |x| are real numbers. 

• max {x, y} < z if and only if x < z and y < z. 

• max {x, y} > z if and only if either x > z or y > z. 

• min {x, y} < z if and only if either x < z or y < z. 

• x < y if and only if _ y <” x. 

• max{x,y}= _ min{ _ x,”y}. 

• |x| < y if and only if ~y < x < y. 

In order to avoid tedious, detailed verifications of all the usual facts about real 
numbers, we shall often take such properties for granted. 

Lemma 2.1.15. For each real number x there exists a positive integer n such that 
|x| < n. 

Proof. Let (q, q') € x, and choose a positive integer n such that 

max {|g| , \q'\} < n. 

Using rational arithmetic, we see that — n < q ^ q' < n; whence, by the definition 
of the relation <, we have —n < x < n and therefore |x| < n. □ 


There are two common misconceptions about the constructive properties of the 
real line R: that it is countable and that it is not complete. The following lemma 
enables us to dispose of the first of these misconceptions. 
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Lemma 2.1.16. Let I = [0, 1], and for i = 1,2,3 let J, = §] ■ Then for each 

x£l there exists i € {1,3} such that x ft y for all y S Jj. 

Proof. This is immediate, since the relation < is transitive and, by Proposition 
2.1.8, either x > 1/3 or x < 2/3. □ 


We refer to the interval Ji in Lemma 2.1.16 as the leftmost third of I, and to 
J3 as the rightmost third. The uncountability of K is a simple consequence of the 
following version of Cantor’s theorem. 

Theorem 2.1.17. //(a„) n>1 is a sequence of real numbers, then there exists x £ 
[0, 1] such that x ft a„ for each n. 

Proof. We construct inductively a sequence ((q n , q' n )) n>1 in Q x Q such that for all 
n ^ 1, 


• 0 < ft n - q n = 3~ n , 

• [qn,q' n ] C [q n - i,q' n ^i] , and 

• & n ft y for each y G [q n , q' n ] . 

For convenience we take qo = 0 and q' 0 = 1. Suppose that we have already con- 
structed the elements (q%, q' k ) (0 < k ft n — 1) of Q x Q such that the applicable 
properties hold. By Lemma 2.1.16, either a„ ft y for all y in the leftmost third of 
the interval [q n -i, q' n -i] , or else a„ ft y for all y in the rightmost third. In either 
case we take q n and q' n to be respectively the left and right endpoints of the third of 
[(?„_!, q' n _i\ in question. It is clear that the pair ( q n ,ft n ) satisfies the desired prop- 
erties, and hence that our inductive construction is complete. It is equally clear 
that 

x = {( q ni q' n ) :«eN} 

is a real number. Finally, for each positive integer n we have x £ [q ni q' n J, by Lemma 
2 .1.10, and therefore x/a n . □ 


In this proof we used the principle of dependent choice for the first time and 
in the informal manner typical of the analyst. So it may be a good idea just once 
to show how this informal way of invoking dependent choice can be made more 
rigorous and explicit. Accordingly, let 

X = {(n,q,q r ) : n £N,q £Q,q' £Q,q < q'} . 

Let P be the set of all pairs ((n, q, q') , (m, r, r')) £ X x X such that m = n + 1, 
[r, r'] C [q,q'\, r' - r = | (ft - q), and a„ ft y for each y £ [r, r'] . Applying 
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the principle of dependent choice, construct a mapping / : N — > X such that 
/( 0) = (1, 0, 1) and + 1)) G P for each n. Writing 

f(n- 1) = (n, q n -i,q' n -i) 

for each positive integer n, we see that 

[Vn, q'n\ C [q n - l,q' n -l\ > ~ ^ (q'n- 1 “ 9n-l) , 

and a„ y for each y G [q n , q' n \ , as we required. 

We now seek a constructive analogue of the classical least-upper-bound princi- 
ple. Let S' be a set of real numbers. We say that a real number b is an upper bound 
of/for S if s ^ b for all s G S; and that b is the (perforce unique) least upper bound 
of S if it is an upper bound and for each x < b there exists s G S such that x < s. 
In the latter event we also call b the supremum of S and we denote it by sup S. 

The classical least-upper-bound principle implies the law of excluded middle. 
To see this, let P be a syntactically well-formed proposition, and consider 

S = {0} U{iGR:i = 1A P} , 

which contains 0 and is bounded above by 1. If sup S exists, then it is either greater 
than 0 or less than 1. In the first case, there exists x G S such that x > 0; whence 
x = 1 and P holds. In the other case we must have ~^P. 

We define a set S of real numbers to be upper order located if for all rational 
numbers a, b with a < b, either x ^ b for all x in S or else there exists xGS with 
a < x. We can now state the constructive least-upper-bound principle, which is also 
known as the Dedekind ( order) completeness of R. 

Theorem 2.1.18. Let S be an inhabited set of real numbers that is both bounded 
above and upper order located. Then the least upper bound of S exists. 

Proof. Let B be the set of upper bounds for S, and define 

£ = {(<7, q') G Q x Q : 3s G S 3b G B (q < s < b < q')j . 

Taken with Lemmas 2.1.10 and 2.1.9, the hypotheses ensure that f is inhabited. 
If (q,q r ) and (r,r') belong to £, then there exist Si,s 2 G S and b!,b 2 G B such 
that q ^ Si < bi < q' and r < s 2 < b 2 < r'. Then si < b 2 , so q < b 2 ^ r'\ 
similarly, r ^ q' . By Lemma 2.1.1, there exists a rational number in [q, q'] 0 [r, r '] . 
To complete the proof that £ is a real number, we show that for each rational e > 0 
there exists (q,q') G £ with q' — q < e. To this end, fix (a, a') in £. If a = a', 
then there is nothing to prove; so we may assume that a < a' . Construct rational 
numbers ao = a < ai < a 2 < • • • < a n = a 1 such that a, — a,_i < e/2 for 1 ^ i ^ n. 
Since S is upper order located, either a 2 G B or else ai < s for some element s 
of S. In the first case, (00,122) G £ and a 2 — oq < s. In the second, either <23 G B 
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and therefore (ai, 03 ) G £ and 03 — 01 < e; or else <22 < s for some element s of X. 
Carrying on in this way, since a n € B we can be sure of finding k < n — 1 such that 
(afe,afc. 1 - 2 ) G £ and 0^+2 — < e. Thus £ is indeed a real number. 

To show that £ is an upper bound for S, consider any (q, q') in £ and any s in 
S. There exists b G B such that q ^ b ^ q' . For any (r, r') in s we have r ^ s, by 
Lemma 2.1.10, and therefore r ^ b; whence r ^ q' . It follows that s ^ £. 

Finally, if x < £, then we can find (q. q') G £ and (r, r') G x such that q > r' . It 
follows from Lemma 2.1.10 that x ^ r' < q. By definition of £, there exists s G S 
with q ^ s; whence x < s, by Lemma 2.1.9. This completes the proof that £ is the 
least upper bound for S. □ 


Let S' be a set of real numbers. We say that a real number b is a lower bound 
of/for S if b ^ s for all s G S; and that b is the (perforce unique) greatest lower 
bound of S if it is a lower bound and for each x > b there exists s G S such that 
x > s. In the latter event we also call b the infimum of S and we denote it by inf S. 

Corollary 2.1.19. Let S be an inhabited set of real numbers that is bounded below 
and is lower order located in the following sense: for all rational numbers a, b 
with a < b, either a is a lower bound for S or else there exists s G S with s < b. 
Then the greatest lower bound of S exists. 

Proof. Apply Theorem 2.1.18 to the set 

T = {"s : s G S} , 

which is inhabited, bounded above, and upper order located, to construct its supre- 
mum b. Then _ b is the infimum of S. □ 


We now want to introduce the arithmetic operations on real numbers. Given 
real numbers x and y, we need to find the appropriate ways of combining the 
rational pairs (representing intervals) that constitute the real number x with those 
that constitute y, in order to create the rational pairs that represent xoy, where 
o stands for any of the operations +, — , x , -G. We begin with the easy definitions of 
+ and — , for the moment leaving aside the more complicated ones for x and -G. 

We define the sum x + y and difference x — y of the real numbers x, y to be, 
respectively, 

x + y = {(s, s') : 3 (q, q') Gx3(r,r) Gy(s = q+ rAs' = q l + r')} , 
x - y = {(s, s') : 3 (q, q') G x 3 (r, r') G y (s = q - r A s' = q' - r')} . 

Let us verify, for example, that x+y is a real number. Let (qi, q[) G x and (n, r[) G 
y. Certainly, qi+ri ^ q[ + r l 1 . Moreover, given a positive rational number e, we can 
arrange that q[—qi < e/2 and r[ — n < e/2, so ( q\ + r'f) — ( q\ + n) < e. It remains 



34 


2 Techniques of Elementary Analysis 


to show that if also (q 2 , q' 2 ) G x and (r 2, r^) G y, then the intervals [g* + Vi , g' + r'] 
(i = 1,2) in Q intersect. This is easy: there exist rational numbers £,77 such that 
qi ^ ^ q[ and r, ^ 77 ^ r\ (i = 1,2); whence 

Qi + n ^t + V^Qi + r’i (i = 1,2). 

Thus x + y is a real number. 

Note that if x < y, then x + z < y + z for all real z; and that x — y = x+ ( _ y) . 
In view of the latter, we adopt the normal convention of writing — x instead of _ x. 

Lemma 2.1.20. For each real number x there exists a positive integer N such that 
max{|g| , |g'|} < N whenever (g, g') G x and q' — q < 1. 

Proof. With n as in Lemma 2.1.15, set N = n + 1. If (g, q') G x and q' — q < 1, 
then, using Lemma 2.1.10, we obtain 

g<x^|x|<n<.ZV 


and therefore 


q' <q+l<n + l = N. 


Since (—q',—q) G — x and |— x| = |x| , we likewise have —qf < N and —q < N. 
Hence \q\ < N and \q'\ < N, so max{|</| , jg'j} < TV. □ 


We next define the product x x y, also written x • y or xy, to be the set of all 
rational pairs (s, s') such that there exist (q, q') G x and (r, r') G y with 

s = min {qr, qr ' , q'r. q'r 1 } , s' = max { qr , qr\ q'r, q'r 1 } . 

Certainly, s ^ s'. Using Lemma 2.1.20, compute a positive integer N such that if 
(q, q') G x and q' — q < 1, then max {|g| , |q'|} < TV, and such that if (r, r') G y and 
r' — r < 1, then max{|r| , | r' \ } < N. Given a rational e with 0 < e < 1, choose 
(q,q') G x and (r, r') G y such that q' — q < e/2 N and r' — r < e/2 N. Routine 
rational arithmetic calculations show that 

s' — s < (q 1 — q) max {|r| , |r'|} + (r' — r) max {|g| , \q'\} < e. 

For t = l,2 let (g», q[) G x, (n, r[) G y, and 

Si = min {q^i, g*r •, g'n, q[r •} , s' = max {q^, g'n, q[r '} . 

Pick ^ in [gi,^] n [92,^2] aR d 77 in D [r 2 ,r ' 2 ] ; it is easy to verify that £77 G 

[si, si] fl [S2, s^] . This completes the proof that xy is a real number. 

When dealing with division, we consider two real numbers x,y with y 0. In 
the case y > 0 we construct a rational pair (s, s') in the quotient x/y (also written 
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p) as follows. We pick (q, q') G x and (r, r') G y such that r > 0. If q ^ 0, we set 
s = q/r' , s' = q' /r; if q < 0, we set s = q/r,s' = q' /r' . In the case y < 0 we define 
x/y to be — ((— x)/y). We leave as an exercise the details that x/y is indeed a real 
number, and that x(l/x) = 1. 

As classically, we say that a sequence (x„) n>1 of real numbers converges to a 
real number Xoo, called the limit of the sequence, if 

Ve > 0 3N Vn ^ TV (jxoo — x„| < e) . 


We then write 


Xoo as n 


or 

Xoo = lim x„. 

n—> oo 

The uniqueness of the limit, and the basic algebraic properties of sequences of 
real numbers, will be assumed without proof, since the proofs are virtually the 
same as their classical counterparts. However, we should note that the monotone 
convergence theorem for sequences, 

Every increasing sequence in R that is bounded above (that is, whose terms 
form a set that is bounded above) converges, 

implies LPO. 

By a Cauchy sequence of real numbers we mean a sequence (x„) n>1 such that 
Ve > 037VVm,n ^ N (|x m — x„| < e) . 

Every convergent sequence of real numbers is a Cauchy sequence. We now counter 
the second major misconception about the constructive nature of R by establishing 
its so-called (Cauchy) completeness. 

Theorem 2.1.21. Every Cauchy sequence of real numbers converges to a real num- 
ber. 

Proof. Let (x„) ra>1 be a Cauchy sequence of real numbers, and, using countable 
choice, compute a function k Uk from N + to N + such that 

Vfc V?n, n ^ rife (|x m — x„| < 2~ fe ) . 

Again using countable choice, construct a sequence ((<7fc, <?fc))fe> 1 such that 

Vfc (( qk , q'k ) e X„ fc A q' k - q k < 2~ k ) . 

Define rational numbers 

r k = qk — 2 k , r' k = q' k + 2 k . 
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Then for all n ^ n k , 

r k < x„ fc - 2~ k < x„ < x„ fc + 2 _fc < r*.. (2.3) 

It follows that for all j ^ k, 

S e [rj,r'j\ D [r k ,r' k ] . 

Since r' k — r k < 2~ k+2 — » 0 as k — > oo, we conclude that 

Xoo = {{r k ,r' k ) : k ^ 1} 

is a real number. From (2.3) we have x n £ [r k ,r' k \ for all n ^ n k ■ It follows that 
Vfc Vn 3s n k (|x„ - x^l < r k - r k < 2 ~ k+ 2 ) . 

Noting Lemma 2.1.6, we see that x„ — > ■x OCl as n — > oo. □ 


We shall see in later chapters that completeness is often used in constructive 
mathematics to prove propositions that are immediate consequences of omniscience 
principles. 

For the remainder of this book we shall drop the use of boldface type to denote 
real numbers; it has served its purpose to signal a distinction between a rational 
number and a real number (a set of special pairs of rational numbers). We shall also 
assume, without further comment, basic properties of real numbers — for example, 
x 2 ^ 0 for all real x — that can easily be deduced from the foregoing results. 

The complex plane C consists of all complex numbers — ordered pairs (x, y) of 
real numbers — with addition and multiplication defined by 

(x, y ) + (x', y') = (x + x',y + y') , 

(x, y) x ( x' , y’) = ( xx' - yy', xy’ + x'y) . 

The equality and inequality on C are defined by 

(x, y) = (a/, y') <*=> x = x' A y = y' 

(x, y) ± (x' t y') x^x'Vy^y 1 . 

We embed K as a subset of C in the usual way by identifying the real number x with 
the complex number (x, 0) . The pair i = (0, 1) then has the special property that 
i 2 = —1; and every complex number 2 = ( x,y ) can be written in the form x + i y, 
with real part Re z = x and imaginary part Im z = y. The complex conjugate of 
z = x + iy is the number x — i y, denoted by z* . We shall assume basic definitions 
and properties of C, such as its completeness, as they are needed. The same goes 
for the Euclidean spaces 1" and C n , which are now defined in the standard ways. 
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2.2 Metric Spaces 

Since we are assuming some familiarity with the classical theory of metric spaces, 
in the following we shall emphasise differences between the classical and the con- 
structive theory, as well as those constructive properties, such as locatedness, that 
play no role in classical analysis. We normally denote the metric on a set X by p. 

The definitions of such notions as metric, (metric) subspace, ball, interior, and 
cluster point are exactly as in classical analysis; see, for example, [47]. The interior 
of a subset S' of a metric space X is denoted by S° and is a subset of S; if S = S°, 
then S is said to be open. The set S of cluster points of S is called the closure of 
S (in X), and contains S; if S = S, then S is said to be closed in X. If S = X, 
then S is dense in in X. This happens if and only if for each x € X and each £ > 0 
there exists s € S such that p (x, s ) < e. 

We denote the open and closed balls in X with centre a and radius r > 0 by 
B(a,r) and B(a,r) respectively. The inequality on a metric space is defined by 

x ^ y p{x,y) > 0. 

If Xi, . . . , X n are metric spaces, with metrics p 1 , . . . , p n respectively, then the 
Cartesian product X = X\ x • • • x X n is a metric space relative to the product 
metric 

P = Pi H 1" Pn- 

We then call (X, p) the product of the metric spaces X lf . . . , X n . 

As in the classical theory, both X and 0 are open, arbitrary unions of open 
sets are open, and finite intersections of open sets are open; in other words, the 
open sets form a topology on X. Likewise, both X and 0 are closed, and arbitrary 
intersections of closed sets are closed. But we cannot prove that the union of two 
closed sets is closed, even in the case X = K and the two closed sets are closed 
intervals: both the intervals [—1,0] and [0,1] are closed in R, and their union is 
dense in [—1,1]; but if that union is closed, then we have 

\/x € M (a: < 0 V x ^ 0) , 

a proposition equivalent to LLPO. 

Proposition 2.2.1. The logical complement of an open subset S of X is closed in 
X, and coincides with the complement of S. 

Proof. Given a cluster point a of -> S and any point x of S, choose, in turn, r > 0 
such that B{x,r) C S, and y € ~>S such that p(a,y ) < r/2. If p(a,x) < r/2, then 

P(x, y ) < p(a, x) + p{a, y ) < r, 

so y € S, which is absurd. Hence, by Proposition 2.1.8, p{a,x) > 0. Since x £ S is 
arbitrary, it follows that a € ~5 C ~^S. Thus ~>S is closed, and -> S C ~S C ->S\ 
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whence -<S = ~5. 


□ 


In contrast to the classical situation, we cannot prove constructively that the 
complement of a closed subset of R is open (Exercise 9) . 

The classical definitions of convergent sequence, Cauchy sequence , and complete 
metric space carry over unchanged into the constructive setting, as they did in the 
special case of the metric space R. A complete subset of a metric space X is closed 
in X\ and a closed subset of a complete space is complete. Moreover, the product 
of finitely many complete spaces is complete. 

We now introduce the first of two major themes of metric space theory. Given 
£ > 0, by an e- approximation to a subset S' of a metric space X we mean an 
inhabited subset T of S such that for each s £ S there exists t € T with p (s, t) < e. 
If for each £ > 0 there exists a finite ^approximation to S, then we say that S is 
totally bounded. 

The closure of a totally bounded subset of X is totally bounded. If a subset S 
of X contains a dense totally bounded set, then S itself is totally bounded. The 
product of finitely many totally bounded spaces is totally bounded. 

There are at least two reasons why total boundedness is so important in con- 
structive analysis. The first is that with its help we can compute suprema and 
infima in many important situations. The second is that, coupled with complete- 
ness, total boundedness gives the only one of three classically equivalent notions of 
compactness that can be used in constructive analysis. 

The next lemma enables us to replace “finite” by the weaker “finitely enumer- 
able” in the definition of total boundedness. 

Lemma 2.2.2. Let £ be a positive number, and X a metric space with a finitely 
enumerable e-approximation. Then X has a finite rj- approximation for each rj > e. 

Proof. Let {aq, . . . , x n } be an ^approximation to X , and let rj > e. We may assume 
that n > 1. Either p(xi, xf) > 0 whenever i j, or p(aq, xf) < r/ — e for some pair of 
distinct indices i, j. In the former case, {aq, . . . ,x n } is a finite ?;-approximation. In 
the latter case we can delete Xj from {aq, . . . , x n }, to obtain a finitely enumerable 
E'-approximation with e' = s + p(xi, xf) < 77. Applying this argument at most n — 1 
times, we arrive at a finite 77 -approximation to X. □ 


Corollary 2.2.3. A metric space X is totally bounded if and only if for each e > 0 
there exists a finitely enumerable e- approximation to X. 

Proof. This is an immediate consequence of Lemma 2.2.2. □ 


A metric space X is separable if it has a countable dense subset S; an enumer- 
ation (x n ) n>1 of S is then called a dense sequence in X. The real line is separable, 
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since, by Lemma 2.1.10 and the definition of “real number”, it has Q as a countable 
dense subset. 

Proposition 2.2.4. A totally bounded metric space is separable. 

Proof. If, for each positive integer n, F n is a finite 1 /n-approximation to the metric 
space X , then (J F n is a countable dense subset of X. □ 

71^1 


We now have what is among the most useful of all results about totally bounded 
sets. 

Proposition 2.2.5. If S is a totally bounded subset of R, then sup S' and inf S 
exist. 

Proof. We first consider the case where S = {xi, . . . ,x n } is finitely enumerable. 
Given real numbers a , b with a < b, we apply Proposition 2.1.8 repeatedly, to prove 
that either Xk < b for each k or else there exists j such that Xj > a. It follows from 
the constructive least-upper-bound principle that sup S exists. 

Now consider the general case. Again let a, b be real numbers with a < b, but 
this time write e = (b — a)/2 and construct a finite £-approximation {aq , . . . ,x n } 
to S. By the first part of the proof, 

a = sup {aq, . . .,x n } 

exists. By Proposition 2.1.8, either a > a or a < a + e. In the first case there 
exists j such that Xj > a. In the second, consider any x £ S. Choosing j such that 
\x — xj\ < e, we have 


x ^ Xj + \x — Xj | < a + £ < a + 2e = b. 

So in this case, b is an upper bound for S. It follows from Theorem 2.1.18 that sup S 
exists. Similar arguments show that inf S exists. □ 


A mapping / : X — > Y between metric spaces is 

• sequentially continuous at x € X if for each sequence (x n ) n>1 that converges in 
X to x, the sequence (/(a; n )) n >i converges in Y to /( x); 

• sequentially continuous if it is sequentially continuous at each point of X\ 

• continuous at x £ X if for each e > 0 there exists S > 0 such that 
p (f(x),f(x')) < £ for all x' £ X with p (x, x') < S; 

• continuous if it is continuous at each point of X. 
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Although sequential continuity is classically equivalent to continuity, the most we 
can say constructively is that the latter notion implies the former. 

For constructive practice it is frequently necessary to assume the following 
stronger continuity property. We say that / : X — > Y is uniformly continuous (on 
X) if for each £ > 0 there exists 5 > 0 such that for all x, x' G X, if p(x,x r ) < 5, 
then p f{x')) < e. Clearly, uniform continuity implies continuity. Although 

in CLASS and in INT every continuous mapping from the bounded closed interval 
[0, 1] into R is uniformly continuous, since this is not the case in RUSS we cannot 
expect to prove it in BISH; see Chapter 3 of [34]. 

Proposition 2.2.6. If X is a totally bounded space, and f a uniformly continuous 
mapping of X into a metric space, then f(X) is totally bounded. 

Proof. Given £ > 0, compute S > 0 such that if x,y € X and p(x,y) < 6, 
then p(f(x),f(y)) < e. There exists a finite ^-approximation {aq, . . . , x n } to X. 
For each x € X pick i such that p(x,x f) < 5. Then p(/(x), /(aq)) < e. Hence 
{/(aq), . . . , f(x n )} is a finitely enumerable ^approximation to f(X). □ 


Corollary 2.2.7. Let f be a uniformly continuous mapping of a totally bounded 
space into R. Then ran (/) is bounded, and the supremum 

sup f (x) = sup {y : y G ran(/)} 

and infimum 

inf f(x) = inf {y : y G ran(/)} 

of f exist. 

Proof. This is an immediate consequence of Propositions 2.2.6 and 2.2.5. □ 


Corollary 2.2.8. If X is a totally bounded metric space, then its diameter, 
diam(X) = sup {p(x, y) : x,y G X} , 

exists. 

Proof. Since the mapping p on the totally bounded product space X x X is uni- 
formly continuous, the desired conclusion follows from Corollary 2.2.7. □ 


We now introduce a second major theme in constructive metric space theory. 
An inhabited subset S' of a metric space X is located in X if for each x G X the 
distance 
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p ( x , S) = inf {p(x, s) : s £ S} 

exists. 

We cannot prove the proposition “every inhabited subset of R is located” con- 
structively, since it implies LEM. To see this, let P be any syntactically correct 
proposition, and 

S = {0} U {x G R : (a: = 1) A P} . 

If S is located, then either p( 1, S') > 0 or p( 1, S) < 1. In the first case we have ~>P. 
In the second, choosing s £ S such that p(l,s) < 1, we see that s ^ {0} , so s = 1 
and P holds. 

It is hard to overemphasise the importance of locatedness in constructive analy- 
sis. To illustrate, we anticipate later sections and chapters by pointing out that 
the norm of a bounded linear functional on a normed space exists if and only if 
the kernel of the functional is located, and that locatedness plays a vital role in 
the separation and Hahn-Banach extension theorems. It is only when we work con- 
structively that the significance of locatedness (a property that holds automatically 
under classical logic) for the proofs of many existence theorems is revealed. 

A subset 5 of a metric space is located if and only if its closure is located; in 
fact, for each x £ X we have 


p(x,S) = p(x, S) 

if either side of this equation exists. Note that even if a subset S' of a metric space 
X is located, we may not be able to prove that its metric complement 

—S = {x £ X : p(x, S) > 0} 


is located (Exercise 11). 

Proposition 2.2.9. A totally bounded subset of a metric space is located. 

Proof. Let S be totally bounded in the metric space X. For each x £ X the map- 
ping s p{x, s) is uniformly continuous on S. It follows from Corollary 2.2.7 that 
inf{p(a;,s) : s £ S} exists. □ 


The next two proofs use a technique that will reappear from time to time. We 
have a finitely enumerable set {aq, ... ,x n } of a metric space X, a located subset 
S of A, and positive numbers a, b with a < b. We write {1, . . . , n} as a union of 
subsets P, Q such that 


i £ P => p(aq, S ) < 6, 
i £ Q =4> p(xi, S ) > a. 


By concentrating on one of those sets, we are able to reach our desired conclusion. 
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Proposition 2.2.10. A located subset of a totally bounded metric space is totally 
bounded. 

Proof. Let S' be a located subset of a totally bounded metric space X, and let £ > 0. 
There exists a finite s/3-approximation {cci , . . . , x n } to X. Using Proposition 2.1.8, 
write {1, . . . , n} as a union of two sets P and Q, where p(xi, S) < 2e/3 if i £ P, and 
p{xi, S ) > e/3 Hi £ Q. For each i £ P there exists s* € S such that p(xi, s,;) < 2e/3. 
Given s € S, choose i such that p(s,Xi) < e/3. Then p(xi,S) < e/3, so i ^ Q\ 
whence i £ P and therefore 

£ 2s 

p(s, Si) < p(s, Xi) + p(Xi, Si) < - + y = £. 

Thus {sj : i £ P} is a finitely enumerable ^approximation to S. □ 


The next theorem will provide us with a rich supply of totally bounded, and 
hence located, subsets of a totally bounded space. 

Proposition 2.2.11. Let X be a totally bounded space, Xq a point of X, and r a 
positive number. Then there exists a closed, totally bounded subset K of X such 
that B(xo,r) C K c B(xo,8r). 

Proof. With Pi = {cco}, we construct inductively a sequence (F n ) n ^i of finitely 
enumerable subsets of X such that 

(a) p(x,F n ) < 2~ n+1 r for each x in B(xo,r), 

(b) p(x,F n ) < 2 ~ n+3 r for each x in F n+ 

To this end, assume that Fi, ... , F n have been constructed with the appropriate 
properties, and let {:c\ , . . . , Xn} be a 2 _n r-approximation to X. Write {1, . . . , N} 
as a union of subsets A and B such that 

p(xi,F n ) < 2~ n+3 r if i £ A, 
p{xi, F n ) > 2~ n+2 r if i£B. 

Then 

F n+ i = {xi : i £ A} 

clearly satisfies the appropriate instance of (b). Let x be any point of B(xo,r). By 
the induction hypothesis, there exists y in F n with p(x,y) < 2 ~ n+1 r. Choosing i in 
{1, . . . , N} such that p(x, xf) < 2 ~ n r, we have 

p(xi,F n ) < p(xi,y ) < p(x,x.i) + p(x,y) < 2 ~ n+2 r. 
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Thus i ^ B, so i G A and therefore Xi G F n+ Since p{x,xf) < 2 _ ( n+1 ) +1 r, the set 
F n+ 1 therefore satisfies the appropriate instance of (a). This completes the inductive 
construction. 

Let K be the closure of (J F n in X. We see from (a) that B(xo,r) C K. On 

n^l 

the other hand, given x G K and a positive integer n, we can find m and y G F m 
such that p(x,y) < 2 ~ n+4 r. If m ^ n, then by (b), there exist points y m = y , 
Urn- 1 e F m _ 1 , y n G F n such that p(y i+ i,y t ) < 2~ t+3 r for n < i < m - 1. Thus 

m— 1 oo 

p{y, Fn) < P {y, Vn) < ^2 p(Vi+i , Vi) < '52 2_I+3r = 2 _n+4 r (2.4) 

i=n i=n 

and therefore 

p(x, F n ) < p(x, y) + p(y, F n ) < 2 _n+4 r + 2 _n+4 r = 2 _n+5 r. 

n 

It follows that (J Fi is a finitely enumerable 2 _n+5 r-approximation to K. Since n 

i=l 

is arbitrary, we conclude that K is totally bounded. 

Finally, taking n = 1 in (2.4), we see that p(y,x o) < 8r for each y in (J F n ; 

n^l 

whence K C B(xq, 8r). □ 


Corollary 2.2.12. If X is a totally bounded space, then for each e > 0 there exist 
totally bounded sets K \, . . . , K n , each of diameter less than or equal to e, such that 

n 

x= U K t . 

i= 1 

Proof. Given £ > 0, construct an s/16-approximation {x \, . . . , x n } to X. By Propo- 
sition 2.2.11, for each i in {1, . . . ,n} there exists a closed, totally bounded set I( t 

n 

such that £?(#*, e/16) C K t C B(xi,s/ 2) . Clearly, X = (J 7\.j. Also, p(x,y) < £ 

i — 1 

for all x, y in A/, so diam (A/) ^ e. □ 


A property P, applicable to certain elements of a set S, is said to hold for all 
but countably many x in S if there exists a sequence (x„) n>1 in S such that P{x) 
holds whenever x G S and x ^ x n for each n. The sequence (x n ) n>1 is then called 
the excluded sequence, and the elements x such that x ^ x n for each n are said to 
be admissible, for the property P. 

Theorem 2.2.13. Let f be a uniformly continuous mapping of a totally bounded 
metric space X into R. Then for all but countably many r £ R the set 

X (fx) = {x G X : f(x) < r} 
is either totally bounded or empty. 
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Proof. By Corollary 2.2.12, for each positive integer k there exist a positive integer 
n k and totally bounded sets X k p (1 < j < nk), each of diameter less than 1 /k, 
whose union is X. Define the excluded sequence (f„) n>1 to be an enumeration of 
the real numbers 


c k j = inf {f(x) : x G X k j} (, k > 1; 1 < j < n k ). 


(Note that the numbers c k ,j exist in view of Corollary 2.2.7.) Let r ^ r n for each n. 
If Ckj > r for 1 ^ j < n k , then X ( /, r ) = 0. So we may assume that there exists 
v ^ n k such that c k j < r for 1 ^ j ^ v, and c k j > r for v < j ^ n k - For each 
j ^ v choose x k ,j G X k .j such that f(x k ,j) < r. For all such k,j and all x G X k j 
we have 


p(x,Xk,j ) < diam (X k j) < 


(2.5) 


Consider any x G X(f,r) and any positive integer k. Choose j with 1 ^ j ^ n k 
and x G Xkj. Then c k j ^ fix) ^ r, so c k j < r (since c k j ^ r) and therefore 
j ^ v. By (2.5), p(x,x k ,j) < 1/k. Hence {x k j : 1 ^ j ^ v} is a finitely enumerable 
l//c-approximation to X(f,r). Since k is arbitrary, it follows that X(f,r) is totally 
bounded. □ 


A complete, totally bounded metric space X is said to be compact. The bounded 
closed intervals [a, b] in R, and the closed balls in C, are compact; the product of 
finitely many compact spaces is compact. A compact subset of a metric space is 
both closed and (by Proposition 2.2.9) located; and a closed, located subset of a 
compact space is compact (see Proposition 2.2.10). 

Corollary 2.2.14. Under the hypotheses of Theorem 2.2.13, if X is compact, then 
X(f,r) is either compact or empty for all but countably many rGl. 

Proof. In view of Theorem 2.2.13, it suffices to note that, by the uniform continuity 
of /, the set X(f,r) is closed in X , and therefore complete, for each admissible r. 

□ 


We say that two subsets S,T of a metric space X are apart from each other, and 
we write S tx T, if there exists r > 0 such that p(s, t) ^ r for all s G S and t G T. A 
mapping / : X — > Y between metric spaces is said to be strongly continuous if for 
all subsets S,T of X with f(S) ix f(T) we have S tx T. It is simple to show that 
uniform continuity implies strong continuity. The following partial converse will be 
used shortly. 

Proposition 2.2.15. Let f be a strongly continuous mapping of a metric space X 
onto a totally bounded metric space Y. Then f is uniformly continuous. 
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Proof. Given e > 0, construct an e/8-approximation {f(x i), . . . , f(x n )} to Y, and 
define 

Y l = B{f(x i ),l) 1 X^f-^Yf). 

Write {{i,j) : 1 < i, j ^ n} as the union of subsets P and Q such that 
(hj) e P=> p(f(Xi),f(Xj)) < 

(i,j) G Q => ptfix^Jixj)) > |. 

If (i, j) G P, then, by the triangle inequality, p (f(x),f(x')) < s for all x, x' G X. So 
we may assume that there exists (i,j) G Q. For such (i,j) and any element (y,y') 
of Yi x Yj, since p(f(xi),y ) < s/8 and p (y 1 , f(xj)) < e/8, we have 

P(y,y') > P(f(xi),f(xj)) - p(f{xi),y) - p(y',f(xj)) > |. 

Hence 1) ix Yj. The strong continuity of / now yields XitxiXj-, whence there exists 
rij > 0 such that p (x, x') ^ r. h3 for all x G X t and x' G Xj. Let 

6 = min {ri,j : (i,j) G Q} > 0, 

and consider points x, x’ of X with p (x, x') < 6. Choose i. j such that f(x) G Y t and 
f(y) G Yj. If (i, j) G Q, then p( x,x') ^ r'ij ^ S, a contradiction. Hence (i,j) G P; 
so p{f(xi),f(xj )) < 3e/4 and therefore p(f(x),f(x')) < s. □ 


A mapping f : X — > Y between metric spaces is said to be strongly injective , 
or a strong injection , if it is one-one and the inverse mapping is strongly continuous; 
in other words, if for all subsets S,T of A with S cc T we have f(S) ix f{T). Every 
strongly injective mapping is injective. 

The following is a constructive substitute for the classical theorem that a contin- 
uous one-one mapping of a compact metric space into a metric space has compact 
range and a continuous inverse (a theorem that is false in RUSS). 

Proposition 2.2.16. Let f be a uniformly continuous strong injection of a com- 
pact metric space X into a metric space Y. Then f~ 1 is uniformly continuous and 
strongly injective on /(A), and /(A) is compact. 

Proof. The mapping / -1 : /(A) — > A is strongly continuous, and A is totally 
bounded. Hence, by Proposition 2.2.15, / -1 is uniformly continuous on /(A). Since 
/ is uniformly continuous and hence strongly continuous, / -1 is also strongly injec- 
tive. By Proposition 2.2.6, /(A) is totally bounded; so it remains to prove that it 
is complete. Accordingly, let (f(x n )) n>1 be a Cauchy sequence in /(A). Using the 
uniform continuity of / -1 , we see that (x n ) n>1 is a Cauchy sequence in A. Since 
A is complete, (x n ) n>1 converges to a limit x in A. Finally, since / is continuous, 
(/ ( x n )) n ^ i converges to f(x) in /(A). □ 
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Let JC(X) denote the set of all compact subsets of a compact metric space X. 
For all A,Bg K.{X), since the map x p (x, B) is defined (B being located) and 
is uniformly continuous on A, the number 

m,A,B = inf {p (x, B) : x G A} 

exists, by Corollary 2.2.7. Likewise, ms , a exists. It is left as an exercise to show 
that the mapping 

(A, B) max {itia.b, tub, a} 
from IC(X) x JC(X) into the nonnegative real line 

R 0+ = {x£l:i)0} 

is a metric — we call it the Hausdorff metric — with respect to which the space IC(X) 
is compact. 

An inhabited metric space X is said to be 

► locally totally bounded if each bounded subset of X is contained in a totally 
bounded subset; 

► locally compact if it is both locally totally bounded and complete. 

Every compact space is locally compact. The spaces R and C, and the product 
spaces R" and C n , are locally compact. A metric space X is locally compact if and 
only if every bounded subset of X is contained in a compact set. 

The following lemma prepares the way for our next theorem, which deals with 
certain fundamental properties of a locally totally bounded space. 

Lemma 2.2.17. Let Y be a located subset of a metric space X , and T a totally 
bounded subset of X that intersects Y. Then there exists a totally bounded set S 
such that T fl Y C S C Y. 

Proof. Using Theorem 2.2.13, construct a sequence (a n ) n ^ i such that for each 
positive integer n, 0 < a n +\ < a n < 1/n and 

T n = {x G T : p(x, Y) < a n } 

is totally bounded. Let F n be a finite 1/n-approximation to T n , and construct a 
mapping <f n : F n — > Y such that p(x, f> n {x)) < 1/n for each x € F n : then set 

S n = { <t>ni x ) ; x e F n} ■ 

Let S be the closure of (J S n in Y. We show that Sn is a 3/iV-approximation to 

n^l 

(J S’*,. Let k ^ N and y € S/. Then y = <f> k {x) G Y for some x € F k . Since 

k^N 

F k C Tfc C Tn, there exists z G Fn such that p(x, z) < 1/N; whence 
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1113 

p(y, + p{x, z) + p(z, <t> N {z)) < J. + ff + ^ ff- 

N 

We now see that (J Sk is a 3 /N - approximation to (J S n . So (J 5 n , and therefore 

/c=l n^l n^l 

5, is totally bounded. 

If x £ T n Y, then for each n we have x £ T n ; so there exists z € F n such that 
p(x, z) < 1/n and therefore p(x, </> n (z)) < 2 /n. Thus p(x, S) < 2/n for each positive 
integer n. Since S is closed in Y, it follows that x belongs to S. □ 


Proposition 2.2.18. Let Y be an inhabited subset of a metric space X. 

(a) If Y is locally totally bounded, then it is located. 

(b) If X is locally totally bounded and Y is located, then Y is locally totally bounded. 

Proof. Assume first that Y is locally totally bounded. Let yo £ Y and x £ X. The 
set 

B = {y £ Y : p(y, y 0 ) < 2 p(x, y 0 ) + 1} , 

being bounded in Y, is contained in a totally bounded subset K of Y. By Theorem 
2.2.13, there exists r > 2 p(x,yo) + 1 such that the set 

T = {y £ K : p(y, y 0 ) < r} 

is totally bounded and hence located. Then p(x,T ) ^ p(x,yo) < p(x,yf) + 1, so for 
each y in Y, either p(x,y) > p(x,T) or p(x,y) < p(x,yo) + 1. In the latter case, 

P(y, Vo) < p(x, y) + p(x, y 0 ) < 2 p(x, y 0 ) + 1 < r, 

so y £ T and therefore p(x,y) p(x,T). It follows that p(x,Y) exists and equals 
p(x,T). Thus Y is located. 

Now assume that X is locally totally bounded and that Y is located in X. 
Let B be a bounded subset of Y. Then there exists a totally bounded subset T 
of X such that B C T. By Lemma 2.2.17, there exists a totally bounded subset S 
of Y such that TC\Y C S and therefore B C S. Thus Y is locally totally bounded. □ 


2.3 Normed Linear Spaces 

Let IK stand for either R or C, and let X be a linear space over K. An inequality 
relation ^ on X is said to be compatible with the linear structure on X if for all 
x, y £ X and all t £ K, 
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X ^ y x-y^ 0, 

I + J//0 => x 0 V y ^ 0, and 

tx ^ 0 => t 0 A x 0. 

It readily follows from the first of these properties that 

x ^ y =4- Vz £ X (x + z 7^ y + z) . 

The requirement of compatibility between the inequality and the linear structure 
is a natural one and is automatically fulfilled by the denial inequality under classical 
logic. Constructively, it is certainly true that the denial inequality fulfils the first of 
the three requirements for compatibility; but unless we accept Markov’s principle 
we cannot expect to prove that it satisfies the second or the third. In the case 
X = K the standard inequality is compatible with the linear structure. 

From now on, unless we state otherwise, when we refer to a linear space we 
mean one that is equipped with a compatible inequality. Let X be such a space. 
By a seminorm on X we mean a mapping || || : x ||x|| of X into R 0+ such that 
for all x, y in X and all t in K, 

• ||x|| > 0 => 

• \\tx\\ = |t| ||x|| , and 

• lk + 2/IK Ml + ||y|| • 

We call the pair (X, || ||) — or, when no confusion is likely, just X itself — a semi- 
normed (linear) space over K. We say that the seminormed space X is real or 
complex, depending on whether K is R. or C. If x £ X and ||x|| > 0, then x is called 
a nonzero vector ; if ||x|| = 1, then x is called a unit vector. We call the seminormed 
space X nontrivial if it contains a nonzero vector. 

If the inequality on the seminormed space X satisfies 

i 7^ 0 || oj || > 0, (2.6) 

then we call || || a norm on X, ||x|| the norm of the vector x, and ( X , || ||) — or just 
X — a normed (linear) space over K. Note that every seminorm || || on a linear space 
X induces an inequality relation — namely, the one defined by (2.6) — with respect 
to which || || becomes a norm. 

Let X be a normed space. Then the mapping (x,y) ||x — y\\ of X x X into 

R is a metric on X, and is said to be associated with the norm on X. When 
we consider X as a metric space, it is understood that we are referring to the 
metric, usually denoted by p , associated with the given norm on X. Note that the 
inequality corresponding to the metric associated with the norm on X is just the 
original inequality on X. 

The unit ball of X is the closed ball with centre 0 and radius 1, 
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B x = B x { 0, 1) = B{ 0, 1) = {x £ X : ||x|| < 1} , 

relative to that metric. This ball, like any open or closed ball in a normed space, is 
located. 

It is a simple consequence of the triangle inequality that 

I INI - IHII <11* -i/ll 

for all vectors x, y in a normed space X. It follows from this that if a sequence 
(xn)n^i converges to a limit x in X, then the sequence (||aJn||) n >i converges to ||x|| 
in R. 

Perhaps the simplest examples of a norm are the following ones on K": 

(xi, . . ,,x n ) max {|xi| , . . . , \x n \} , 

(xi, \j\xx\ 2 H 1- \x n \ 2 , 

(*i, • • • ,x n ) |xi| H h |x„| . 

The second of these is called the Euclidean norm on which, when equipped 
with that norm, is called Euclidean n-space or simply Euclidean space. 

If S' is a compact metric space, then for each uniformly continuous map / : 
S — > K the sup norm 

ll/ll = sup{|/(x)| : x £ S} 

is well defined, by Corollary 2.2.7. It is easy to see that the mapping / ||/|| is 

a norm on the space C(S,K) of uniformly continuous functions from S to K, taken 
with pointwise operations of addition and multiplication-by-scalars. We usually 
denote C(S, C) by C(S). Convergence and Cauchyness with respect to the sup norm 
on C(S, K), where S is a compact metric space, are called uniform convergence and 
uniform Cauchyness respectively. The space C(S, K) is an example of a Banach 
space: that is, a complete normed space. The standard proof of this in classical 
analysis is constructive, and is left as an exercise. The Euclidean space K" is also 
a Banach space. 

Let Xi, X 2 be normed spaces over K, and recall that the standard operations of 
addition and multiplication-by-scalars on the product vector space X = Xi x X 2 
are given by 


{xi,x 2 ) + (x'ljXg) = (aq + xi, x 2 + x' 2 ) , 
t(x i,x 2 ) = (txi,tx 2 ) , 

where Xi £ Xi, x’ i £ X t . and t £ K. It is easy to verify that the mapping 

(xi,x 2 ) max{||xi|| , ||x 2 ||} 

is a norm on X , and that the metric associated with this norm is the product metric 
on X (considered as the product of the metric spaces X\ and X 2 ). Taken with this 
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norm, which we call the product norm, X is known as the product of the normed 
spaces X\ and X 2 . The product norm and the product space for a finite number of 
normed spaces are defined analogously. 

The proof of the following is left as an exercise. 

Proposition 2.3.1. Let X be a normed space over K. Then 

(a) the mapping (x,y) x + y is uniformly continuous on X x X; 

(b) for each t € K the mapping x tx is uniformly continuous on X; 

(c) for each x £ X the mapping t tx is uniformly continuous on IK; 

(d) the mapping (t, x) tx is continuous on IK x X. 

If X is a normed space and S is a linear subset of X, then the restriction to 
S of the norm on X is a norm on S; taken with this norm, S is called a normed 
linear subspace, or simply a subspace, of X. It is left as an exercise to show, from 
Proposition 2.3.1, that the closure of a subspace in X is also a subspace of X. 

Recall that a mapping u between vector spaces X, Y is linear if 

u(x + y) = u(x) + u(y) and u(tx) = tu(x) 

whenever x, y £ X and t £ 1. For example, the mapping x Ax on IK”, where A 
is an n-by-n matrix over IK, is linear. 

If X = y, then we refer to a linear map u : X — » Y as an operator on X. On 
the other hand, if Y = IK, then u is called a linear functional on X. 

Continuous linear mappings between normed spaces are the backbone of func- 
tional analysis. In order to characterise these maps, we need a simple lemma, whose 
proof is left to the reader. 

Lemma 2.3.2. For each element x of a normed space X, 

INI = inf {i^i : t G K, t ± 0, ||tx|| < lj . 

Proposition 2.3.3. The following are equivalent conditions on a linear mapping 
of a normed space X into a normed space Y : 

(a) u is continuous at 0. 

(b) u is continuous on X. 

(c) u is uniformly continuous on X. 

(d) u(Bx) is a bounded subset ofY. 
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(e) u maps bounded subsets of X to bounded subsets ofY. 

(f) There exists a positive number c, called a bound foru, such that ||u(a;)|| ^ c ||x| 
for all x £ X. 


Proof. Suppose that u is continuous at 0. Then there exists r > 0 such that 


IK*) II = ||u(x) -u(0)|| < 1 

whenever ||x|| ^ r. For each nonzero t € IK with ||tx|| ^ 1 we have ||rtx|| ^ r and 
therefore 

IK*)II = V\f\^ rtx ^ ^ V\f\- 

It follows from Lemma 2.3.2 that ||w(x)|| ^ - ||x|| for all x £ X. Hence (a) implies 

fl- 
it is clear that (f) => (e) =t> (d). Next, suppose that there exists c > 0 such 
that ||it(x)|| ^ c whenever ||x|| ^ 1. Given x in X and e > 0, we have either x ^ 0 
or ||x|| < e. In the first case, 


IK*) II 


11*11 



< C 11*11 • 


In the second, ||u ( £ 1 x) || < c, so 

IK*) II < ce < c(||x|| +e) . 

Thus ||u(x)|| < c(||x|| + e) in each case. Letting e — > 0, we obtain ||u(x)|| < c ||x||. 
Thus (f) holds. We now have 

IK* -2/) ll< c \\ x ~y\\ C x,y£X ), 

from which it follows that u is uniformly continuous on X. Thus (d) =>• (f) =$■ 
(c). 

Finally, it is immediate that (c) => (b) =>■ (a). □ 


In view of property (e) of Proposition 2.3.3, we refer to a continuous linear 
mapping between normed spaces X,Y as a bounded linear mapping. In the cases Y = 
X and Y = K, we use the terms bounded operator and bounded linear functional , 
respectively. 

Bounded linear functionals are associated with linear subsets that, in the case 
of a general normed space, correspond to planes in three-dimensional geometry. 

A linear subset H of a normed space X is called a hyperplane if there exist an 
associated vector Xq £ X and a positive number c such that 
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> ||x — Xo|| ^ c for each x € H, and 

> each x G X is represented (perforce uniquely) in the form x = tx o + y with 
t gK and y G H. 

Recall that the kernel of a linear mapping u between linear spaces X, Y is the 
set 

ker(w) = w' 1 (0) . 

We say that u is nonzero if there exists x G X such that u(x) ^ 0. If u is a nonzero 
bounded linear functional, then ker(it) is a hyperplane. To see this, choose x$ G X 
such that u(xq) = 1, and then (by Proposition 2.3.3) c > 0 such that c |u(x)| < ||x|| 
for each x G X. If x G ker(u), then 

||a: — x 0 || ^ c | u(x — xo)| = c | u(x) — u(xo)| = c. 

On the other hand, for each x G X we have 

x = u(x)x o + (x — u(x)x o) , 

where u(x) G IK and x — u(x)x o G ker(u). 

Proposition 2.3.4. Let X be a normed space, and H a hyperplane in X with 
associated vector Xq. Then there exists a unique bounded linear functional u on X 
such that ker(u) = H and u(x o) = 1. 

Proof. Compute c > 0 such that ||x — Xo| ^ c for each x G H. For each x G X 
there exist unique u(x) G IK and (j){x) € H such that x = u(x)x o + 4>(x). By the 
uniqueness, u{x o) = 1 and H = ker(u). Moreover, for each e > 0 we have either 

|«(aO|<-M+ e (2.7) 

c 

or else u(x) ^ 0. In the latter case we have 

1 

u(x) 

from which we see that (2.7) holds. Since e > 0 is arbitrary, we conclude that 1/c 
is a bound for u. □ 


If u : X — > Y is a bounded linear mapping between normed spaces and 

INI = sup{||u(x)|| : x G X, ||x|| < 1} (2.8) 

exists, we call this number the norm of u and we say that u is normed or normable. 
In that case, 


|x|| = |u(x)| Xq 
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IK^OII < INI INI (x&X), 

and if X is nontrivial, then 

Ml = sup{||u(x)|| : x £ X, ||x|| = 1} . 

Although there is no general criterion for the existence of the norm of a bounded 
linear mapping between normed spaces, such a criterion — and a very useful one at 
that — exists for bounded linear functionals. Our proof depends on a preliminary 
lemma. 

Lemma 2.3.5. Let u be a bounded linear map of a normed space X into a normed 
space Y. Then ker(u) is located if and only if 

s x = inf {t : t > 0, u( x) £ tu(Bx)} (2-9) 

exists for each x £ X. Moreover, if u(x) ^ 0, then s x > 0. 

Proof. For each x £ X we have 

{t > 0 : 3y £ ker(u)(||x - y|| < t)} 

= {t > 0 : 3z £ X (|| 2 || < 1 and u ( x — tz) = 0)} 

= {t > 0 : u(x) £ tu(Bx)}- 

The first part of the lemma now follows, since p (x, ker(u)), if it exists, equals the 
infimum of the first set, whereas s x , if it exists, equals the infimum of the third. 

Now let u(x) ^ 0, and choose r > 0 such that if y £ X and ||y|| < r, then 
|«(y)| < |u(x)| . Supposing that s x < r, we can find a positive t < r and an element 
z of B x such that u(x) = tu(z ) = u(tz ); but ||tz|| < r, so \u(tz)\ < |u(x)| , a con- 
tradiction. Hence s x ^ r. □ 


Proposition 2.3.6. A nonzero bounded linear functional on a normed space is 
normed if and only if its kernel is located. 

Proof. Let u be a nonzero bounded linear functional on the normed space X. Sup- 
posing first that u is normed — in which case ||w|| > 0 — consider any a £ X. For 
each y £ ker(w) we have 


||a 


2/11 > 


\ u (g-y)\ 

Ml 


H«)l 

Mil ' 


On the other hand, if 0 < e < ||w|| and we choose a unit vector x £ X such that 
u(x) > ||u|| — e, then 

x € ker(u) 


z = a — 
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and 


_ ,, _ H«)l , Kg)| 

u(x) \\u\\~s‘ 


Hence p(a, ker(u)) exists and equals |u(a)| / ||it|| . Since a £ X is arbitrary, ker(w) 
is located. 

Conversely, if ker(it) is located, then by Lemma 2.3.5, 
s = inf {t > 0 : 1 € tu(Bx)} 


exists and is positive. We show that ||it|| = 1/s. For each x £ Bx we have either 
u(x) < 1/s or u( x) ^ 0; in the latter case, 


K/OI 

u(x) 


< 1 


and 


1 = 


MjO| , 

|u(a:)| y u(x) 


so l/\u(x)\ ^ s and therefore |u(a;)| < 1/s. Thus 1/s is a bound for u. On the 
other hand, given e with 0 < e < 1/s, choose t < s/ (1 — es) and x G Bx such that 
1 = tu(x). Then u(x) = 1/t > 1/s — e. It now follows that ||«|| exists and equals 
1/s. □ 


We cannot expect the norm of a normed linear functional u to be attained at 
some vector in the unit ball of the domain space. For a certain important class of 
spaces, though, attainment of the norm does occur. 

A normed space X is said to be uniformly convex if for each e > 0 there exists 
S with 0 < S < 1 such that if x, y are unit vectors in X with ||| (a: + y) || >1 — 5, 
then 1 1 £ — y\\ < e. Every linear subspace of a uniformly convex normed space is 
itself uniformly convex. 

Proposition 2.3.7. If u is a nonzero normed linear functional on a uniformly 
convex Banach space X , then there exists a unique unit vector x € X such that 
u(x) = || U || . 

Proof. Construct a sequence (x n ) n>1 of unit vectors in X such that u (x n ) — > ||it|| 
as n — > oo. Given e > 0, choose 6 > 0 as in the definition of “uniformly convex”. 
Since 1 — | < 1, there exists a positive integer N such that (l — I) IMI < u(x n ) 
for all n ^ N. For to, n N we then have 



2.3 Normed Linear Spaces 


55 


3s u(x m ) - - I u(x m - X„)| 

> U { X m) - ^ (||u|| - u(x m )) - i (||u|| - u(x n )) 

> (i-0 IMI- jIHI-f INI 

= (1-S)\\u\\. 


Hence ||| (x m + x„)|| > 1 — <5 and therefore \\x m — x n \\ < e. Since e > 0 is arbitrary, 
we conclude that (x n ) n>1 is a Cauchy sequence. It therefore converges to a limit x 
in the complete space X. By the continuity of the norm and of u , we have ||x|| = 1 
and u(x) = ||w|| . 

Now let y be any unit vector such that u(y) = ||«|| = u(x). For each positive 
integer n define Z 2 n -i = x and Z 2 n = V- Then {z n ) n>1 is a sequence of unit vectors 
such that u(z n ) — > ||u|| ; so, by the first part of the proof, (. z n ) n>1 converges to a 
unit vector, which clearly must equal both x and y. Hence x = y. □ 


We next present an important construction of new normed spaces from old. 
Let Y be a located subspace of a normed space X, and define new equality and 
inequality relations on X by 

X =X/Y x' p (x — x' , Y) = 0, 

X ^ X/Y x' ^==> p (x — x', Y) > 0. 

Taken with this equality relation, X becomes the quotient space of X by Y, and is 
usually redesignated X/Y. The identity (linear) mapping x x from the original 
normed space X into X/Y is then called the canonical injection i x/y of X onto 
X/Y. If Y is also closed in X , then 

\\ x \\x/y = P( X > Y ) 

defines a norm — the quotient norm — on X/Y. Since ||x|| x ^ F ^ ||x — 0|| = ||x|| , the 
canonical injection i x/y is a bounded linear mapping. 

Note that the locatedness of Y is essential for the definition of the equality, 
inequality, and norm on X/Y. 

Proposition 2.3.8. If Y is a closed, located subspace of a Banach space X, then 
the quotient space X/Y is a Banach space. 

Proof. Given a Cauchy sequence (x n ) n ^i in X/Y, choose a strictly increasing se- 
quence (nk)k ^ l of positive integers such that 
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ll^rifc +1 X nk || x /y <2 ' (k > 1). 

Setting yi = 0, we construct inductively a sequence (yk) k >i in Y such that for each 

k, 


1 1 (Xnk+i Vk+l) ( x nic Vk) || <2 . (2.10) 

Indeed, having constructed elements yi, ... , y k of Y with the applicable properties, 
we have 


inf { ||x rajfe+1 - (. x nk - y k ) - y\\ : y € V } = ||x nfc+1 - x nk \\ x/Y <2 k , 

so there exists y k +i G Y such that (2.10) holds. We now see from (2.10) that 
( x nk — Vk)h > i is a Cauchy sequence in the Banach space X\ whence it converges to 
a limit z in X. We then have 


\\Xn k - z\\ x/Y = \\x nk -z- VkWx/Y < \\( X n k ~ Vk) ~ z\\ > 0 as k ^ 00 . 

Thus the Cauchy sequence (x n ) n>1 in X/Y has a subsequence that converges in 
X/Y. It follows that the sequence (x n ) n>1 itself converges in X/Y. □ 


Normed spaces form the natural abstract context for infinite series. Given a 
sequence (x n ) n ~^i of elements of a normed space X, we define the corresponding 

oo n 

series ^2 x n to be the sequence (s n ) n ^> i, where s n = ^ Xk is the nth partial sum 

n = 1 k = 1 

oo 

of the series. The series x n is said to be 

n— 1 

> convergent if the sequence (s n ) n ^ i converges to a limit s in X, called the sum 
of the series; 

OO 

> absolutely convergent if the series Yl ||®n|| is convergent in R. 

n — 1 


oo 

In the first case we write Y x n = s. 

n— 1 

If (a n ) n >i and (b n ) n>1 are sequences of real numbers such that 0 < a n ^ b n for 

OO OO 

each n, and if Y bn converges, then Y °n converges (the comparison test). For, 

n= 1 n= 1 

given £ > 0, since the partial sums of a convergent series form a Cauchy sequence 
we can find N such that 

k 

*<Ybn<e 

n=j 


k 

whenever k > j ^ N. For all such j,k we then have 0 < ^ a,j < e. So the partial 

n—j 

oo 

sums of Y a n form a Cauchy sequence, which converges by the completeness of M. 

n = 1 



2.3 Normed Linear Spaces 


57 


A particular case of this occurs when b n = r n for some fixed r with |r| < 1: in that 

OO OO 

case, b n is a geometric series, and a n converges to a sum at most . 

71—1 71 = 1 

We shall resume consideration of normed spaces in Chapter 4. 


Exercises 

1. Prove that two real numbers x and y are equal if and only if x U y is a real 
number. (This is not a typographical error: remember, a real number is actually 
a set.) 

2. Prove the following for real numbers x, y, z: 

x > y => x + z > y + z, 

(x > 0 A y > 0) => xy > 0, 

xy ^ 0 => (x yf 0 A y ^ 0) . 

3. Prove that for all x, y £ R. the sets max { x , y} and min {x, y} are real numbers; 
that min {x, y} ^ x ^ max {x, y} ; that x = min {x, y} if and only if x ^ y, and 
that x = max {x, y} if and only if x ^ y. Prove also that max \x. y} < e if and 
only if x < e and y < e; that max {x, y} ^ e if and only if x ^ e and y ^ e\ and 
that max {x, y} > z if and only if either x > z or y > z. 

4. Prove that \x\ = x if and only if x ^ 0, and that \x\ = — x if and only if x ^ 0. 
Prove also that \x\ < y if and only if —y < x < y. and that \x\ ^ y if and only 
if —y ^ x ^ y. 

5. Prove Archimedes’ axiom in the following form: If x > 0 and y ^ 0, then there 
exists an integer n such that nx > y. 

6. Let x , y be real numbers with y ^ 0. Prove that x/y is a real number and that 

y/y = 1 - 

7. Prove that the statement 

\/x £ M (-< (x = 0) => By £ K (xy = 1)) 
implies Markov’s principle. 

8. Prove that the mapping (x,y) x + y is a function on R x t, and that 

it is strongly extensional. Prove analogous results for the mappings x 
-x, (x,y) xy , (x,y) x/y (for y ± 0), x ^ |ar| , (x,y) min {x,y} , 

and (x, y) max { x , y} . 
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9. Give Brouwerian examples to show that each of the following statements is 
essentially nonconstructive: 

(a) If S C R is closed, then ~S is open in R. 

(b) If S' C R and ~S is open, then S is closed in R. 

10. Let X be a separable metric space. Prove that (i) dense subsets and (ii) located 
subsets of X are separable. 

11. Show that the statement “the metric complement of every located subset of R 
is located” implies the weak law of excluded middle (WLEM), ~^P V H-iP. 

12. Prove that the closure of a totally bounded subset of a metric space X is totally 
bounded, and that if a subset S' of AT contains a dense totally bounded set, then 
S itself is totally bounded. 

13. Prove Corollary 2.2.8 directly, without using Proposition 2.2.5, Proposition 
2.2.6, or Corollary 2.2.7. 

14. Let h be a mapping of a metric space into a totally bounded space AT such that 
/ o h is uniformly continuous for each uniformly continuous map / : X — ► R. 
Prove that h is uniformly continuous. 

15. Let ft, be a mapping of a compact metric space AT into a metric space Y such 
that / o ft is uniformly continuous on X for each uniformly continuous mapping 
f : Y — > R. Prove that ft is uniformly continuous if and only if its range is 
totally bounded. Prove also that if Y is locally compact, then ft is uniformly 
continuous. 

16. Let a < ft, and let S, T be inhabited open subsets of R such that [a, ft] C S U TT. 
Prove that S D T is inhabited. 

17. Let f : [a, ft] — ► R be sequentially continuous, with /(a) < /(ft). Prove that 
the range of / is dense in [/(a), /(ft)] . 

18. Let / be a bounded interval, and / : I — > R an increasing sequentially contin- 
uous function. Prove that / is uniformly continuous on /. 

19. Let S' be a dense subset of a metric space AT, and / a uniformly continuous 
mapping of S into a complete metric space Y. Prove that / has a uniformly 
continuous extension to a mapping of AT into Y ; that is, a uniformly continuous 
mapping F : AT — > Y such that F(x) = f(x) for all x £ S. 

20. Prove that the Hausdorff metric on the set /C(AT) of compact subsets of a com- 
pact metric space AT is indeed a metric, and that /C(AT) is complete with respect 
to it. Why do we need the sets to be compact in order to be sure that p defines 
a metric? 
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21. Let X be a compact metric space, JC(X) the compact metric space of all compact 
subsets of X (see the preceding exercise), and £ the set of all pairs (x, K) where 
K £ IC(X) and x £ K. Define a metric d on £ by 

d (( x , K) , (x\ K')) = p (x, x 1 ) + p (K, K ') , 

where the second p on the right-hand side denotes the Hausdorff metric on 
K,{X). Prove that £ is compact with respect to this metric d. 

22. Let X be a metric space, and {K n ) n>1 a decreasing sequence of compact subsets 
of X whose diameters converge to 0. Prove that f) K n is compact. Does the 

conclusion hold without the hypothesis that the diameters converge to 0? 

23. Under the hypotheses of Corollary 2.2.14, prove that if r £ R is admissible and 
e > 0, then there exists <5 > 0 such that for each admissible r' with |r — r'\ < S, 

p(X(f,r),X(f,r')) < e, 

where p is the Hausdorff metric on JC(X). 

24. Give a Brouwerian example to show that we cannot drop the condition “for all 
but countably many” from the conclusion of Theorem 2.2.13. 

25. Prove that the product of finitely many (locally) compact metric spaces is 
(locally) compact. 

26. Let / be a mapping from a metric space ( X , p) into R with the following prop- 
erties: 

(a) the set X(f,r) is compact for certain arbitrarily large real numbers r; 

(b) / is uniformly continuous on each of the sets X(f,r). 

Define a new metric p 0 on X by 

Po (x,x') = p(x,x') + |/ (x) -f(x ') | (x,x' £ X) . 

Prove that (X,p 0 ) is a locally compact metric space. 

27. Let S be a located subset of a locally compact metric space (Y,p) such that 
X = —S is inhabited. Let the mapping h : Y — > M be uniformly continuous 
on each bounded subset of Y, and such that the set Y(h,r ) is bounded for 
each r £ R. Prove that the mapping / : x h (x) + p (x, S') -1 on X satisfies 
conditions (a) and (b) of the preceding exercise. 

28. Let X be a uniformly convex Banach space, and u a normed linear functional 
on X. Prove that for any two distinct unit vectors x,y in X either |w(x)| < ||w|| 
or |u(y)| < ||u|| • (This is a strong form of the uniqueness result in Proposition 
2.3.7.) 
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29. Prove Proposition 2.3.1. 

30. Prove Lemma 2.3.2. 


Notes 


In view of our earlier remarks on the status of the power set, it may seem strange, if 
not perverse, to define a real number as a set. We do not believe that our definition 
will give rise to any logical problems, since in practice a real number is likely to 
be given explicitly by interval approximations, rather than by any appeal to the 
existence of the full power set of Q x Q. In fact, many, if not most, real numbers 
will actually arise as limits of sequences of rational approximations, from which it 
is straightforward to construct a set of the type required by our definition of “real 
number” . 

Bishop defined a real number to be a sequence (a?n) n>1 of rational numbers 
that is regular in the sense that \x m — x n \ < 1 /m+l/n for all m and n, two such 
sequences (x n ) n>1 and (y n ) n >i being called equal if \x n — y n \ < 2 jn for each n. 
For an axiomatic development of R see [19]. 

For us, =£ normally has a stronger meaning than the denial of equality. Some au- 
thors, notably the intuitionists, use ff instead of 7 ^ to denote an inequality relation, 
and 7 ^ to denote the denial inequality. 

Bishop originally used the word “continuous” to describe mappings that are 
uniformly continuous on compact sets. While this gets round the independence 
of the uniform continuity theorem relative to BISH, it has the disadvantage that 
we cannot prove that the composition of continuous functions is continuous, since 
the image of a compact set, although totally bounded, cannot generally be proved 
complete. (To be more precise, in RUSS there is a uniformly continuous mapping of 
[0, 1] onto (0, 1].) In a later, unpublished manuscript, The neat category of stratified 
spaces, Bishop introduced the notion of a compact image: a subset of a metric 
space that is the image of a compact metric space under a uniformly continuous 
mapping. He then defined a mapping / : X — > Y between metric spaces to be 
continuous if it is uniformly continuous near each compact image, in the following 
sense: for each compact image S C X and each e > 0 there exists 5 > 0 such 
that p(f(x),f( x')) < £ whenever x £ X, x' € S, and p(x,x') < S. Compositions of 
functions that are continuous in this sense are continuous; but the interval (0, 1 ] is 
a compact image in RUSS, so we cannot prove that the mapping x 1/x on (0, 1] 
is uniformly continuous near compact images. It seems that without Brouwer’s fan 
theorem (see Chapter 5 of [34]), it is impossible to come up with a definition of 
“continuous” in BISH, other than the usual one of “pointwise continuous”, that 
will satisfy all the conditions that one might wish for. For more on this, see [81] 
and [89]. 
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In this connection, Bishop’s statement that “The concept of a pointwise contin- 
uous function is not relevant” ([9], pages ix-x) no longer seems appropriate: there 
are instances — for example, in the theory of operators between Banach spaces — 
where sequential convergence is both useful and the best we can hope to prove 
within BISH; see Chapter 6. 

Some authors remove the word “inhabited” from the definitions of “located” and 
“totally bounded”, thereby allowing the empty set to be both located and totally 
bounded. For us, located sets and totally bounded sets are inhabited, by definition. 

Why do we define “compact” as we have done? Classically, compactness is de- 
fined in terms of the Heine-Borel-Lebesgue covering property: every open cover 
contains a finite subcover. In Brouwer’s intuitionistic mathematics, as classically, 
the interval [0, 1] has this open-cover compactness property; but Brouwer’s proof 
depends on his fan theorem, the addition of which to intuitionistic logic would lose 
us some of the flexibility of interpretation of our constructive mathematics. On the 
other hand, if we add the Church-Markov-Turing thesis to intuitionistic logic, then 
we can prove that [0, 1] does not have the Heine-Borel-Lebesgue covering property 
([34], page 60, Theorem (4.1)). So that property holds for [0, 1] in one model of our 
constructive mathematics but fails to hold in another. Therefore, without adding 
some principle to BISH, we cannot prove or disprove that it holds for [0, 1]. 

A second classical compactness property, equivalent to the Heine-Borel-Lebesgue 
one, is that of sequential compactness: every sequence in the space has a convergent 
subsequence. This fails even for the pair set {0, 1} in the constructive setting. Thus, 
in looking for a workable constructive notion of compactness for metric spaces, we 
are reduced to that of completeness plus total boundedness, which is classically 
equivalent to the other two considered above. Fortunately, completeness and total 
boundedness together make a combination sufficiently powerful for most construc- 
tive purposes. 

Proposition 2.2.11 appears in Aspects of Constructivism, the unpublished notes 
of colloquium lectures given by Bishop at New Mexico State University in December 
1972. It provides a neat proof of Corollary 2.2.12. The proof of Lemma 2.2.17 is 
new, although not unlike that of Lemma (4.10) on page 33 of [34]. 

Exercise 14 provides a constructive version of the classical theorem that the 
unique uniform structure IA compatible with the given topology on a compact Haus- 
dorff space X is induced by the W-uniformly continuous mappings of X into R. Note, 
however, that we require X to be only totally bounded, not compact. 

The notion of strong continuity is best studied in the context of an apartness 
space: that is, an inhabited set X with an inequality and a binary relation txi of 
apartness between sets that satisfies certain natural axioms. For more on apartness 
spaces see [38, 40]. 

Exercise 16 describes one version of the connectedness of the interval [a, b\. For 
other, constructively inequivalent, types of connectedness see [18]. 
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The result in Exercise 17 is perhaps the most general constructive intermediate 
value theorem. 

A linear map u : X — > Y between normed spaces is said to be compact if u{Bx) 
is a totally bounded subset of Y. Proposition 2.3.6 can be generalised as follows: A 
bounded linear mapping of a normed linear space onto a finite-dimensional Banach 
space is compact if and only if its kernel is located ([34], page 36, Theorem (5.4)). 

Bishop required that a Banach space be separable. We prefer not to depend on 
separability unless it is absolutely necessary. 

We use the term “normed”, rather than the usual “normable”, for those linear 
mappings for which the norm exists. 
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The A- Technique 


Here is more matter for a hot brain. 

— Shakespeare, The Winter’s Tale, act 5, scene 2 

In this chapter we discuss a peculiarly constructive technique that, normally under the 
hypothesis that one or more of the metric spaces under consideration is complete, enables 
us to prove results that otherwise would require some nonconstructive principle such as 
LPO, LLPO, or Markov’s principle. 


3.1 Introduction to the Technique 

We begin with another elementary classical result that does not hold in constructive 
mathematics: 

For each complex number z there exists 9 G [0, 27t) such that z = \z\ e l6 , and 
such that if 9 yf 0, then z=f=- 0. 

We show that this proposition entails LPO. To do so, we consider an increasing 
binary sequence (A„) n ^i with at most one term equal to 1. Define a sequence 
(z n )n^i °f complex numbers such that 

\ n = 0 v z n = 0, 

A n = 1 — A„_i Zk = — e 17r / 2 for all k ^ n. 

n 

Then \z m — z n \< l/n whenever m > n, so (z n ) n ^i is a Cauchy sequence and 
therefore converges to a limit z in the complete metric space C. Assume that 2 = 
\z\ e l8 for some 9 G [0, 27t), and that if 9 0, then z ^ 0. Either 9 < 7t/ 2 or 9 > 0. 

In the first case we have A„ = 0 for all n: for if we suppose that there exists n such 
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that X n = 1 — A„_i, then z = e 17T / 2 /n and therefore 9 = tt/2, a contradiction. In 
the case 9 > 0 we have z ^ 0, so there exists N such that Zn yf 0; then A„ = 1 for 
some n ^ N. 

The constructive problem occurs when 2 is near 0 but we cannot decide whether 
z = 0 or z / 0. Indeed, if 2 = 0 or z ^ 0, then we can find 9 G [0, 27t) such that 
z = \z\ e ie . 

Now, it might be thought that the failure of the modulus-argument decomposi- 
tion of a general complex number would mean that there was no constructive proof 
of the existence of square roots in C; for in order to find a square root of z, we nor- 
mally would write z = |z|e‘ e and then compute ± \J\z\e l6 ^ 2 . Although this method 
of finding a square root certainly will not work unless we already can decide that 
z = 0 or z / 0 — which, in general, we cannot — there is a constructive proof of the 
existence of square roots, one that uses the completeness of C. 

To see this, consider any complex number z, and note that for each positive 
integer n we have either \z\ < 1/n 2 or \z\ > l/(n+ l) 2 . Thus we can successively 
construct the terms of an increasing binary sequence (A ra ) n ^i such that 


A„ = 0 

An = 1 


* < 


* > 


(n+iy 


We may assume that Ai = 0. If X n = 0, set C, n = 0; if A n = 1 — A„_i, choose 
9 e [0, 27 t) such that z = \z\ e l8 , and set ( k = \J\z\e l9 ^ 2 for all k ^ n. Then (C n )n>i 
is a Cauchy sequence. To see this, let m ^ n. If \ m = 0 or A„ = 1, then ( rn = ( n . 
If X m = 1 and X n = 0, then there exists a unique k such that n < k ^ m and 
Afc = 1 — Afc_i; whence \z\ < 1 / (k — l) 2 and 

ic TO - a = ic fc - oi = ^ 


Thus, in all cases, |C m — Cnl < l/ n - Also, |^ 2 — z\ < 1/n 2 for each n. Since C is 
complete, (C n )n> l converges to a limit ( € C that clearly satisfies C 2 = z. 

This technique of “flagging” alternatives by the terms of a binary sequence 
(An) n ^i, constructing an appropriate Cauchy sequence, and using the limit of that 
sequence to circumvent our inability to make the sort of decision embodied in 
LPO, LLPO, or Markov’s principle, is quite common in constructive mathematics. 
We present in this chapter a number of results whose proofs are based on this A- 
technique. Our first such result is Bishop’s lemma, in whose original proof in [9] 
(page 177, Lemma 7) the A-technique first appeared. 


Proposition 3.1.1. Let S be a complete, located subset of a metric space X, and 
let x £ X. There exists s € S such that if x ^ s, then p(x, S) > 0. 
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Proof. Let Sq £ S. If p(x, S ) > 1/2, then we can take s = Sq. Hence we may assume 
that p{x, S) < 1. Now construct an increasing binary sequence (A n ) n>1 such that 
Xi = 0 and 


A ra = 0 =>• p(x,S) < 

n 

An = 1 => p(x, S) > 

n + 1 

If A„ = 0, pick s n £ S such that p(x,s n ) < 1/n. If A ra = 1, set s n = s n -i- Then 
p(s m ,s n ) < 2/n whenever m ^ n, so (s„) n>1 is a Cauchy sequence in S. Since S 
is complete, this Cauchy sequence converges to a limit s € S satisfying 

2 

p{s,s n ) < (n fz 1) • 
n 

If x ^ s, then we can compute a positive integer N such that p(x,s) > 3/N. If 
Xn = 0, then p(x,sn) < 1/-AT and so 

2 

p{s, s N ) > p(x, s ) - p(x, s N ) > — , 

a contradiction. Thus A^r = 1 and therefore p(x, S) > 0. □ 


Bishop’s lemma is simple to prove using classical logic. For if x € S, then by 
taking s = x we render the hypothesis of the implication “if i/s, then p(x, S) > 0 ” 
false; whereas if x S, then the conclusion of that implication holds since S is closed 
in X 1 so we may take s = Sq. This argument depends on the full law of excluded 
middle, but in fact only LPO is needed to establish Bishop’s lemma classically even 
when S is merely closed and not necessarily complete. To see this, first construct 
the sequence (A„) n>1 as in our constructive proof of the lemma. If A„ = 0 for all 
n, then p(x,S) = 0, so x £ S and we can take s = x; whereas if An = 1, then 
p(x, S) > 0 and we can take s = sq- 

In order to obtain a constructive proof of Bishop’s lemma, we replace LPO by 
the completeness of S. This is a typical situation, in which a result proved classically 
using an omniscience principle is proved constructively under some completeness 
condition using the A-technique. 

Now, it follows immediately from Bishop’s lemma that if S' is a complete, located 
subset of a metric space, then ~S = —S. The A-technique provides us with a related 
result in a Banach space, in which locatedness is replaced by convexity. 

A subset C of a linear space X is said to be 

> convex if tx + (1 — t) y £ C whenever x, y £ C and 0 ^ t ^ 1; 

> absorbing if for each x £ X there exists t > 0 such that x £ tC. 
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Proposition 3.1.2. If C is a convex, absorbing subset of a Banach space X , then 
~C is dense in -<C. 


Proof. Let x £ ->C, let e > 0, and choose S > 0 such that <5||x|| < e. Then 

x' = (1 + J)x fL (1 + £)<?. 


Given y £ C, we show that x' yf y. To that end, construct an increasing binary 
sequence (A n ) n>1 such that 


A ra = 0 => \\x' - y || < 
An — 1 x yf y. 


1 

(n + 1) 2 ’ 


We may assume that Ai = 0. Define a sequence (z„) n>1 in X as follows. If A„ = 0, 
set z n = 0; if A„ = 1 — A„_i, set Z]. = n(x'— y ) for all k ^ n. Since in the latter 
case we have 

II z n - z n - 1 || = n||x' - y || < 

n 

it follows that ||z m — z n \\ < 1/n whenever m > n. Hence {z n ) n>1 is a Cauchy 
sequence and so converges to a limit 2 in the complete space X. Choose a positive 
integer N such that 2 G NSC , and consider any integer n > N. If A n = 1 — A n -i, 
then z = n(x' — y); whence 

x — y T — z £ y -\- — C £ C SC — (IT 
n n 

a contradiction. Thus X n = \ n -i for all n ^ N. It follows that if Xn = 0, then 
A n = 0 for all n, and therefore x' = y G C C (1 + S)C. This contradiction ensures 
that Ajv = 1; whence x' ^ y. Since y £ C is arbitrary, it follows that rJ G ~C. 
Since also ||x' — x|| = <5||x|| < e, and x £ ->C and £ > 0 are arbitrary, we conclude 
that is dense in ~>C. □ 


Classically, as the reader is invited to prove, the completeness of the space X 
can be removed from Proposition 3.1.2 since we are allowed to use LPO. 

For an example of the use of the A-technique to avoid the application of Markov’s 
principle, we turn to an elementary result in metric topology. We observed in Chap- 
ter 2 that it cannot be proved constructively that the union of two closed subsets of 
a metric space is closed. Nevertheless, it seems reasonable to expect that if A, B are 
closed subsets of R, then the complement, in some appropriate sense, of A U B will 
be the intersection of the complements of A and B. Since it is trivial that the first 
of these complements is a subset of the intersection of the other two, it is enough 
to prove the reverse inclusion. With the help of Markov’s principle, we can prove 
that 
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(-u4n -cB) fliUB = 0. 

Indeed, supposing that x € {~^A n ~^B) n A U B, choose a sequence ( x n ) n>1 of 
elements of A U B converging to x. Define an increasing binary sequence (A„) >1 
such that 

A„ = 0 => Vfc < n (. Xk £ A ) , 

A n 1 A „_1 )' x n £ B. 

We may assume that Ai = 0. If A„ = 0 for all n, then = A, a contradiction. 

Hence, by Markov’s principle, there exists n > 1 such that X n = 1 — A„_i, so there 
exists ni > 1 with x ni £ B. Repeated application of this argument enables us 
to construct a strictly increasing sequence {nk)k>i °f positive integers such that 
x nk £ A for all even k, and x nk £ B for all odd k. Thus x £ A n B, which is a final 
contradiction. 

Since such an argument, depending on Markov’s principle, is not good enough 
for our constructive purposes, it is fortunate that if we work within a complete 
metric space and use the A-technique (on more than one occasion), we can say 
something interesting about the intersection of the complements of two closed sets. 
This will require a preliminary result. 

Lemma 3.1.3. Let X be a complete metric space, A a closed subset of X, and B 
a subset of X. Let x £ let y £ A\J B, and let (y n ) n >i be a sequence in AU B 
that converges to y. Then either x ^ y or there exists n such that y n £ B. 


Proof. We may assume that y\ £ A and, by passing to a subsequence if necessary, 
that p(y n ,y) < 1 /n for each n. Construct an increasing binary sequence (A n ) n>1 
such that 


A„ = 0 => VA: < n ( y k £ A ) , 

An — 1 A n — 1 ■ > ' Pn B. 

If A„ = 0, set £n = Vn\ if A„ = 1 - A„_ 1 , set = ^n-l f° r ^ n - Tlien (O n >l 

is a Cauchy sequence in A; in fact, 

2 

P (£m> O < P (£m> y) + P (£n> V) < ~ {™> > n) . 

n 

Since X is complete and A is closed, (£ n )n>i converges to a limit (tA. Choose 
a positive integer N such that p(x,f) > 2/N. If A at = 1, then y n £ B for some 
n ^ N. If Xn = 0, then either x ^ y or, as we may suppose, p (x, y) < 1/N. If there 
exists m^N such that A m+ i = 1 — X m , then 


p{x,y) ^ p{x,£) - p {£, y) 
> -p(ym,y) > 


2 11 
N~ to ^ N’ 


a contradiction. Hence A n = 0 for all n N and therefore for all n: so y n £ A for 
all n. Since A is closed, y £ A and therefore x yf y. □ 
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Proposition 3.1.4. If A,B are closed subsets of a complete metric space X , then 

~A n = ~iUB. 


Proof. It is clear that UB C ~Al n ~£?. To prove the reverse inclusion, given 
x G n and y £ AU B, we must prove that x ^ y. To this end, choose 
a sequence (j/n) ra>1 in AUB that converges to y. We may assume that yi £ A 
and that p (y n , y) < 1/n for each n. Set Ai = 0, ni = 1, and ^ = y\. In view of 
Lemma 3.1.3, we may also assume that there exists ri2 > 1 such that y n , 2 £ B\ set 
A2 = 0 and pi = y n2 . Now apply the same lemma, but with the roles of A and B 
interchanged. Either we have x ^ y, when for each k ^ 3 we set A k = 1, n k = ri2, 
f k -i = £1, and p k _i = Pi, or else, as we may assume, there exists n 3 > n 2 such that 
y n 3 £ A , in which case we set A3 = 0 and = Hn 3 - Repeating such applications 
of Lemma 3.1.3, we construct an increasing binary sequence (A n ) >1 , an increasing 
sequence (rifc) fe>1 of positive integers, a sequence (f, k )k>i of elements of A , and a 
sequence (p k ) k>1 of elements of B , such that for each k ^ 1, 

• if A 2 fc = 0, then n 2k > n 2 k - 1 and p k = y n2k £ B\ 

• if A 2 fc+i = 0, then n 2k+ 1 > n 2k and £ fc+1 = y n2k+1 G A; 

• if A 2 k = 1 — A 2 fe_i, then x ^ y and for each j > k, n 2j+ 1 = n 2j = n 2 j_i, = £ fc , 
and rjj = p k _ 1; 

• if A 2fc +i = 1- A 2 fc, then x ± y and for each j ^ k, n 2j+2 = n 2j+ 1 = n 2 fc, = £ fc , 
and pj = t) k . 

Then is a Cauchy sequence in A, and ( V k )k>i a Cauchy sequence in B. 

Indeed, for m)»we have 

2 

n 

and similarly p (p m , p n ) < 2 /n. Since X is complete and the subsets A, B are closed 
in X, the sequences (£„) n>1 , ( r l n )n>i converge to respective limits £, £ A and p £ B. 
Choose an integer N > 4 such that p(x,£) > 3/N and p(x,p) > 3 /N. If X N = 1, 
then x ^ y; so we may assume that A at = 0. Either x ^ y or, as we may further 
assume, p (x, y) < 1/N. Suppose there exists m> N such that A m = 1 — A m _i. In 
the case where in is even, we have 

p(x,y) fz p{x,p) -p(p,y) 

> - p(yn m - 2 ,y ) 

3 1 

> N - m — 2 

3 2 

N m 


(since m > 4) 
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a contradiction. If m is odd, a similar argument again leads us to a contradiction. 
Hence A n = 0 for all n ^ N and therefore for all n. It follows that y = £ = y and 
therefore p{x,y) > 3 /N, a contradiction. Hence the case p{x,y) < 1/N is ruled 
out, and so p{x,y) ^ 1/N. □ 


The A-technique turns out to be very useful for clarifying the connections be- 
tween various continuity properties of functions between metric spaces. 

It is trivial that a continuous mapping between metric spaces is strongly ex- 
tensional. If the domain is complete, then we can weaken continuity to sequential 
continuity. 

Proposition 3.1.5. Let f be a sequentially continuous mapping of a complete met- 
ric space X into a metric space Y. Then f is strongly extensional. 

Proof. Given points x, y of X with f(x) ^ f(y), construct an increasing binary 
sequence (A„) n ^i such that 


A n = 0 => p(x, y) < — — , 
n + 1 

An = 1 > ' x y ^ y. 

Define a sequence (z n ) n ^i in X as follows: if A„ = 0, set z n = x: if A n = 1 — A„_i, set 
Zk = y for all k ^ n. Then (z n ) n ^i is a Cauchy sequence and so converges to a limit 
2 in X. Since / is sequentially continuous, the sequence (f(z n )) n ^ i converges to the 
limit f(z) in Y. Choose a positive integer N such that p(f(z n ),f(z)) < p(f(x),f(y)) 
for all n ^ N. If A„+i = 1 — A n for some n ^ N, then z = y and z n = x, so 

P(f(x),f(y)) = p(f(z n ),f(z)) < p(f(x),f(y)), 

a contradiction. Hence A„+i = A„ for all n ^ N. It follows that if An = 0, then 
A n = 0 for all n; whence x = y, and so /( x) = f(y). This contradiction ensures 
that Ajv = 1. □ 


For a linear mapping on a Banach space we obtain strong extensionality without 
even sequential continuity, as a corollary of the following application of the A- 
technique. 

Proposition 3.1.6. Let T be a linear mapping of a Banach space X into a normed 
space Y. Let B be a subset of the graph of T that is closed and located in X x Y, 
and let (x,y) £ X xY be such that y ^ Tx. Then p((x,y),B) > 0. 

Proof. Construct an increasing binary sequence (An ) n>1 such that 
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A„ = 0 ==> p ((x, y) , B) < — 2 , 

(n+ i) 

A„ = 1 => p ((x, y),B)> 0. 

We may assume that Ai = 0. If A„ = 0, set z n = 0; if A„ = 1 — A„_i, pick (el 
such that 

(& TO GBaud ||*-0l + lltf-T0l<^ (3-1) 

n z 

and set Zi = n (x — 0 for all i > n. If j > k and || Zj — Zk || > 0, then A& = 0 and 
\ n = 1 — A„_i for a unique value of n with k < n ^ j; so there exists (eX such 
that (3.1) holds and 

I \zj ~ Zk\\ = \\n (x — 0 II < \ 

Tl rC 

Hence \\zj — Zk\\ < 1/fc whenever j ^ k. Therefore (z n ) n>1 is a Cauchy sequence in 
X and so converges to a limit z € X. Now choose a positive integer N such that 

l+\\Tz\\^N\\y-Tx\\ 

and consider any integer n > N. If A„ = 1 — A„_i, then z = n(x — £) for some £ 
satisfying (3.1), so 

1+ ||T*|| = l + n||Tx^T£|| 

> 1 + n ||Tx — y\\ — n\\y — T^|| 

>l + N\\Tx-y\\ - - 

Tl 

> N \\Tx — y \\ , 

a contradiction of our choice of N. Hence A ra = A„_i for all n ^ N. It follows that 
if Xn = 0, then A„ = 0 for all n, and therefore p ((x, y ) , B) = 0. Since B is closed 
in X x Y, we now have y = Tx, which contradicts our original hypotheses. We 
conclude that Ajv = 1 . □ 

Corollary 3.1.7. A linear mapping of a Banach space into a normed space is 
strongly extensional. 

Proof. Let T be a linear mapping of a Banach space X into a normed space Y, and 
let x £ X satisfy Tx ^ 0. Given 2 G ker(T), apply Proposition 3.1.6 with y = 0 
and B = {( z,Tz )} , to obtain (x, 0) ^ ( z,Tz)\ whence either i/z or else Tz 7 ^ 0. 
Since the latter is absurd, we conclude that x ^ z. □ 


Markov’s principle is all that we need add to intuitionistic logic in order to prove 
Proposition 3.1.6 without the hypothesis that X is complete: for the argument 
towards the end of the proof of that proposition shows that it is impossible that 
p((x,y ) ,B) = 0, so, by Markov’s principle, p((x,y) ,B) >0. In fact (this is left 
to the exercises at the end of the chapter), if Proposition 3.1.6 holds without the 
completeness of X , then we can derive Markov’s principle. 



3.2 Ishihara’s Tricks 


71 


3.2 Ishihara’s Tricks 


In [57], Ishihara introduced the following two lemmas, now called Ishihara’s tricks , 
in which we use completeness to make a decision which at first sight would seem to 
be impossible with purely constructive techniques. 

Lemma 3.2.1. Let f be a strongly extensional mapping of a complete metric space 
X into a metric space Y, and let (x n ) n>1 be a sequence in X converging to a limit 
x. Then for all positive numbers a, j3 with a < (3, either p(f(x n ), f(x)) > a for 
some n or p(f(x n ), f(x)) < (3 for all n. 

Proof. Let ( n k ) k>1 be a strictly increasing sequence of integers such that p(x, x n ) < 
1/ (k + 1) for all n ^ rife. For convenience, set no = 1. For each k ^ 1 set 

Sfc = ma x{p(f(x n ), f(x)) : n k - i ^ n < n k }- 

Construct an increasing binary sequence (Afc) fc>1 such that 

Afc = 0 =t- Vj < k ( Sj < (3 ) , 

Afe = 1 => 3 j < k ( Sj > a) . 

We may assume that Ai = 0. Define a sequence {yk) k >i i n X as follows. If Afc = 0, set 
Uk = x ; if Afc = 1 - Afc_i, choose v k with n fc _ i < v k < n k and p(f(x Vk ),f(x)) > a, 
and set yj = x„ k for all j ^ k. Let i ^ j. If A, = 0 or A j = 1, then y, = yj. If 
A* = 1 — Xj, then there exists k with j < k ^ i and Afc = 1 — Afc_i; so yj = x, and 
Vi = yk = x Uk for some v k such that n k - i < v k < n k and p(f(x Uk ),f(x)) > a\ 
whence 

, . , . 1 1 

p{yi,yj) = p(x„ k ,x) ^ — — < -. 

k'k + 1 J 

It follows that p(yi,yj) < 1 / j whenever i ^ j; so (■ y k ) k >i a Cauchy sequence and 
therefore converges to a limit y in X. Either p(f(x ), /((/)) < a or p(f(x), f(y)) > 0. 
In the first case, if Afc = 1 — Afc_i, then y = x Vk with p(f(x Vk ), f(x)) > a, a contra- 
diction; whence X k = 0 for all k. Then p{f{x n ), f(x)) < /3 for all n. In the second 
case, since / is strongly extensional, x ^ y. Choose a positive integer k such that 
x ^ y K . If A k = 0, then x ^ y K = x, a contradiction; whence A K = 1 and there 
exists n such that p(f{x n ),f(x)) > a. □ 


Let P(n) be a property of positive integers n. We say that P(n) holds 

• eventually , or for all sufficiently large n, if there exists N such that P(n) holds 
for all n ^ N: 

• infinitely often if for each n there exists m > n such that P(m ) holds. 



72 


3 The A-Technique 


The foregoing lemma leads to a technique allowing us to decide between alternatives 
that happen eventually and those that happen infinitely often. 

Lemma 3.2.2. (Ishihara’s second trick) Let f be a strongly extensional map- 
ping of a complete metric space X into a metric space Y, and let ( x n ) n>1 be a 
sequence in X converging to a limit x. Then for all positive numbers a, (3 with 
a < (3, either p(f(x n ),f(x)) > a for infinitely many n or else p(f(x n ),f(x)) < (3 
for all sufficiently large n. 

Proof. Let (n k ) k>1 be a strictly increasing sequence of positive integers such that 
p(x,x n ) < 1/fc for all n ^ n k . Successively applying Lemma 3.2.1 to the subsequence 
( x n) n > nk > construct an increasing binary sequence (A fc) fc>1 such that 

A k = 0 => 3 n ^ n k (p(f(x n ),f(x)) > a ) , 

Afc = 1 => Vn ^ n k (p(f(x n ),f( x)) < (3) . 

We may assume that \\ = 0. Define a sequence (■ Vk) k >i in X as follows. If \ k = 0, 
choose z/fc ^ n k such that p{f{x Vk ), f(x)) > a, and set y k = x Vk \ if \ k = 1— Afc_i, set 
yi = y k - 1 for all i ^ k. Then ( Vk) k >i is a Cauchy sequence: in fact, p(yi, Vj) < 2 /j 
whenever i j. Let y be the limit of (yk) k >i in the complete space X. Either 
0 < p{f{x),f(y)) or p(f(x),f(y)) < a. In the first case, since / is strongly exten- 
sional, x yf y, so there exists a positive integer k with p{x, y K ) > 1/k. If A re = 0, then 
y K = x„ K for some u K n K ; whence p(x,y K ) < 1 /k, a contradiction. Thus A^ = 1, 
and therefore p (f(x n ),f(x)) < (3 eventually. In the case p(f(x),f(y)) < a , if there 
exists k such that A^+i = 1 — Afc, then y = x Vk and p{f{x Vk ), f(x)) > a, a contra- 
diction; whence Afc = 0 for all k, and therefore p(f(x n ), f(x)) > a infinitely often. □ 


A mapping / : X — > Y between metric spaces is said to be sequentially nondis- 
continuous if it has the following property: if (x n ) n>1 converges to x € X and 
p(f (x n ) ,f(x)) S for all n, then <5^0. Clearly, a sequentially continuous map- 
ping is sequentially nondiscontinuous. Ishihara’s first application of his tricks was 
to the converse. 

Proposition 3.2.3. Let f be a mapping of a complete metric space X into a metric 
space Y. Then f is sequentially continuous if and only if it is strongly extensional 
and sequentially nondiscontinuous. 

Proof. Suppose that f is strongly extensional and sequentially nondiscontinuous. 
Let (x n ) n >i be a sequence converging to x € X, and let e > 0. By Lemma 3.2.2, 
either there exists a subsequence ( x nk ) k>1 of (x n ) n>1 such that p(f(x nk ),f(x)) > 
e/2 for all k, or else p (f{x n ), f(x)) < e for all sufficiently large n. In the former 
case, the sequential nondiscontinuity of / shows that £ ^ 0, which is absurd. We 
conclude that the latter case obtains. Hence / is sequentially continuous. 
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The converse is an immediate consequence of Proposition 3.1.5 and the obser- 
vation made before this proposition. □ 


Corollary 3.2.4. For linear mappings of a Banach space into a normed space, 
sequential continuity and sequential nondiscontinuity are equivalent. 

Proof. This follows immediately from Corollary 3.1.7 and Proposition 3.2.3. □ 


We now lift Ishihara’s tricks into a general setting. This both clarifies the ideas 
underlying those lemmas and raises the possibility that some other applications of 
their proof techniques in constructive analysis are, in fact, corollaries of our general 
results. We begin with a generalisation of Ishihara’s first trick (Lemma 3.2.1). 

Proposition 3.2.5. Let X he a complete metric space, let P, Q be subsets of X 
such that X = PUQ, and let x be an element of X such that for each y £ X, either 
x ^ y or y Q. Then for each sequence (x n ) n>1 in X that converges to x, either 
x n € P for all n or else there exists N such that xn € Q. 

Proof. Construct an increasing binary sequence (A ra ) n>1 such that 

A„ = 0 => Vfc < n (, Xk £ P ) , 

A n — 1 A n—1 k X n £ Q. 

We may assume that Ai = 0. If A„ = 0, set y n = x\ if A n = 1 — A„_i, set yk = x n for 
all k ^ n. To see that (y n ) n >i is a Cauchy sequence in X , let £ > 0 and compute N 
such that p(x, x n ) < e for all n ^ N. Let m ^ n ^ N. If A m = 0, then y m = y n = x. 
So we may assume that X m = 1. If X n = 0, then 

p{y m ,Vn) = p(x m ,x) < e. 

If A n = 1, then 

Piljrni ]Jn) — P^mi-^n) ^ p(.X, X rn ) p{.X,X r f) <t 2£\ 

Thus in all cases, p(y m ,yn) < 2e. 

Since X is complete, the Cauchy sequence (y n ) n> 1 converges to a limit y £ X. 
Either x ^ y or y £ Q. In the first case, choosing N such that x ^ y^, we see 
that Ajv = 1- In the second case, if there exists m such that A m = 1 — A m -i, then 
y = x m £ Q, a. contradiction; whence A„ = 0, and therefore x n £ P, for all n. □ 


To derive Lemma 3.2.1, assume the hypotheses of that lemma, define 

P={y £ X : p(f{y),f{x)) < (3} , 

Q = {y £ X : p{f{y),f{x)) > a} , 


and apply Proposition 3.2.5, noting that for all y € Y, 
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• either p(f(y), f(x)) > 0 , in which case, by the strong extensionality of /, x ^ y\ 

• or else p(f(y),f(x)) < a and therefore y ^ Q. 

In order to tackle the generalisation of Ishihara’s second trick (Lemma 3.2.2), 
we prove two lemmas, the first of which is a variant of Proposition 3.2.5. 

Lemma 3.2.6. Let X be a complete metric space, let P , Q be subsets of X such 
that X = P U Q, and let x be a point of X such that for each y £ X, either x ^ y 
or y (f P. Then for each sequence ( x n ) n> i in X that converges to x, either x n £ P 
for all n or else there exists N such that Xn £ Q. 

Proof. Construct an increasing binary sequence (A„) n>1 such that 

X n = 0 => VA; < n (. Xk £ P ) , 

A„ = 1 A n _i v x n £ Q. 

We may assume that Xi = 0. If X n = 0, set y n = x; if X n = 1 — A„_i, set y k = x n _\ 
for each k ^ n. Then (y n ) n>1 is a Cauchy sequence in X and so converges to a 
limit y £ X. Either x ^ y or else y P. In the first case choose N such that 
p{x n -i,y) > \ p(x,y ) for all n ^ N, and suppose that Xn = 0. If A m = 1 — A m _i 
for some m > N, then y = x m -i and so 0 = p(x m -i,y) > | p{x,y ), a contra- 
diction. Hence X n = 0 for all n ^ N and therefore for all n; but this implies that 
y = x, another contradiction. Thus, in fact, Xn = 1 and there exists n ^ N such 
that x n £ Q. On the other hand, in the case y ^ P we must have X n = 0, and 
therefore x n £ P , for all n. □ 


Lemma 3.2.7. Let X be a complete metric space, let P, Q be subsets of X such that 
X = P U Q, and let x be an element of X. Suppose that for any sequence {x n ) n>1 
converging to x in X, either there exists N such that x Xn or else x n Q for all 
n. Then for each y £ X, either x ^ y or y £ Q. 

Proof. Applying the hypotheses to the sequence (x, x , . . .) in X, we see that x ^ Q. 
Given y in X, construct a (perforce increasing) binary sequence (A ra ) n>1 such that 

An = 0 => p(x,y) < 

n 

X n = 1 =>• P (x, y) > — — • 
n + 1 

We may assume that Ai = 0. If X n = 0, set y n = x; if A ra = 1 — A n -i, set y n = y and 
yk = x for each k ^ n + 1. It is easily shown that the sequence (y n ) n > i converges 
to x: so either there exists N such that x ^ yN — y, or else y n ^ Q for each n. In 
the latter case, supposing that y £ Q, we see that A ra = 0 for all n; whence y = x, 
which is impossible, since we have already shown that x Q. We conclude that 

viQ- ' □ 
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This brings us to the generalisation of Ishihara’s second trick (Lemma 3.2.2). 

Proposition 3.2.8. Let X be a complete metric space, let P,Q be subsets of X 
such that X = P U Q, and let x be an element of X. Suppose that for any sequence 
(x n ) n >i converging to x in X, either there exists N such that x ^ Xn or else x n Q 
for all n. Then for any such sequence, either x n £ P eventually or else x n £ Q 
infinitely often. 

Proof. Consider the space X of all sequences that converge to x in X, taken with 
the metric 

Pit, V) = smpp(i n ,r) n ), 

n^l 

where, for example, £ denotes the sequence (£ 1; £ 2j • • •) in X. An argument like the 
standard one for proving the completeness of the classical sequence space (and 
left as an exercise) shows that X is complete. Define 

P = {x G X : 3n ( x n £ Q)} , 

Q = {x £ X : Mn ( x n £ P)} . 

We see from our hypotheses and Lemma 3.2.7 that for each y £ X, either x ^ y or 
y Q. It follows from Proposition 3.2.5 that for any sequence x in X that converges 
to x, either there exists n such that x n £ Q or else x n £ P for all n; whence either 
x £ P or x £ Q. Let x°° = (x, x,x, . . .) , and for each n let x n = (x n , x n +\, . . .) . 
Then x n — > x°° in X. Since our hypotheses also ensure that for each y £ X, either 
x°° ^ y or y ^ P, we can apply Lemma 3.2.6 in the space X to show that either 
x" £ P for each n or else there exists N such that x jV £ Q. This, when unwrapped, 
is precisely the conclusion we want. □ 


It is relatively simple to prove Lemma 3.2.2 using Proposition 3.2.8. Under the 
hypotheses of that lemma, take 

p = {y £ X : p if{x),f(y)) < (3} , 

Q = {y £ X : p (f(x),f(y)) > a} , 


and, for convenience, 


p' = {y £ X : P ( f(x),f(y )) < a} , 

Q' = {y £ x : p ( f{x),f{y )) > a/2} . 

Then X = PU Q and for each y £ X, either x y or y Q. Similarly, X = P' U Q' 
and for each y £ X, either x y or y </ Q' . It follows from the last statement 
and Proposition 3.2.5 that if (j/n) n>1 is any sequence converging to x in X, then 
either y n £ P' for all n or else there exists N such that yw £ Q' ■ Hence either 
y n £Q for all n or else there exists N such that x yt y N . Thus P and Q satisfy the 
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hypotheses of Proposition 3.2.8, from which we immediately deduce the conclusion 
of Ishihara’s second trick. 

We have now given a number of illustrations of the merit of the A-technique as 
a replacement for omniscience principles. The technique will reappear throughout 
the book, where it will be used, for example, to construct best approximations from 
finite-dimensional spaces, to locate certain convex sets in a normed space, and to 
discuss the range of an operator with an adjoint on a Hilbert space. 


Exercises 

1. Using the A-technique, show that if Mo = {ax : x € M} is closed, then a = 0 or 
s / 0. 

2. Let X be a complete metric space, and / : X — > M a sequentially continuous 
mapping such that inf / exists. Suppose that to each e > 0 there corresponds 
<5 > 0 such that if x,y € X and max{f(x),f(y)} < S, then p(x,y) < e. Prove 
that there exists a € X such that if f(a) > 0, then inf / > 0. 

3. A linear mapping T : X — > Y between normed spaces is said to be well-behaved 
if Tx ^ 0 whenever x G ~ker(T). Prove that the statement “every linear 
mapping between normed spaces is well-behaved” is equivalent to Markov’s 
principle. Prove also that every linear mapping from a Banach space into a 
normed space is well-behaved. 

4. Prove that if Corollary 3.1.7 (and hence a fortiori Proposition 3.1.6) holds with- 
out the hypothesis that X is complete, then we can derive Markov’s principle. 

5. Prove de Morgan’s rule for metric complements : If (S n ) n>1 is a sequence of 
located subsets of a metric space X such that S = (J S n is complete and 

n^l 

located, then — S = f) — S n . 

6. A subset S' of a metric space X is said to be uniformly almost located if there 
exists a strictly decreasing sequence (^n) ra>1 of positive numbers converging to 
0 such that the following holds: for each x £ X there exists y £ S such that for 
each ?x, if p{x,y) > S n , then p(x,S) > <5„+i. Prove that if S is an inhabited, 
uniformly almost located subset of a locally totally bounded space, then S is 
located. 

7. Let A be a complete metric space, and a a point of X such that 

s = sup { p(a , x) : x £ X} 

exists. Prove that for each r > 1 there exists b £ X such that s ^ rp(a, b). 
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8. Let T be a sequentially continuous linear mapping of a Banach space X into a 
normed space Y, and let B be the unit ball of the range of T. Suppose that for 
each e > 0, either there exists x € ~T~ l (B) with ||x|| < e or else ~T _1 (i?) is 
bounded away from 0 (that is, there exists 6 > 0 such that ||x|| > 5 whenever 
T(x) €E B). Prove that T is a bounded linear mapping. 

9. Let T : X — > Y be a sequentially continuous linear map between normed 
spaces such that ker(T) is located. Prove that if xo € X and T(x o) 0, then 
p(xo, ker(T)) > 0. 

10. Let X be a metric space, and let S, T be subsets of X such that for each s € S, 
each t € T, and each e > 0, either s ^ t or there exists y £ S(lT with p(t,y) < e. 
Prove that if S n T is complete, then S and T intersect sharply in the following 
sense: if x € ~ (S fl T), then for each s £ S and each t € T, either a; / s or 

i/(. 

11. A sequence (x n ) n>1 in a metric space is said to be weakly discriminating if for 
all positive a, b with a < b, either p ( x n , X\) < b for all n or else p(x n , aq) > a for 
some n. Prove that every totally bounded sequence in a metric space is weakly 
discriminating. 

Let / : X — > Y be a function between metric spaces that maps Cauchy 
sequences to weakly discriminating sequences. Prove that / is strongly exten- 
sional. 

12. A sequence ( x n ) n>1 in a metric space is called an LEM-Cauchy sequence if 

({x n ) n ^ 1 is a Cauchy sequence^ . 

Prove that the following are equivalent conditions on a mapping / between 
metric spaces: 

(a) / maps convergent sequences to weakly discriminating LEM-Cauchy se- 
quences. 

(b) / is sequentially continuous. 

13. Let / be a mapping of a complete metric space X into a metric space Y. Suppose 
that / is strongly extensional and that it is sequentially discontinuous at some 
point x £ X in the following sense: there exist £ > 0 and a sequence (x n ) n>1 
converging to x such that p(f(x),f(x n )) > £ for each n. Prove that LPO holds. 

14. Let / be a strongly extensional mapping of a complete metric space X into a 
compact metric space Y, and let {x n ) n>1 be a sequence converging to x in X. 
Prove that the sequence (f{x n )) n>1 has a convergent subsequence. (Hint: First 
prove that if LPO holds, then every sequence in a compact metric space has a 
convergent subsequence. Then use Ishihara’s tricks and Exercise 13.) 
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15. Let (U n ) n ^i be a sequence of located open sets in a complete metric space X 
such that the metric complement of U = f] U n is inhabited. Prove that there 

n^l 

exist a point Xao in —U and an increasing binary sequence (A„)n^i such that 
for each n, if X„ = 0, then Xao £ U n . and if X n = 1, then — Uk is inhabited for 
some k ^ n. 

Use this result to prove that Markov’s principle is equivalent to the following 
statement (which is classically equivalent to Baire’s theorem): If (U n ) n>1 is 
a sequence of located open subsets of a complete metric space X such that 
— f) U n is inhabited, then there exists n such that — U n is inhabited. 

n^l 


Notes 

A variation of the argument at the beginning of the chapter shows that if for each 
z £ C, there exists 8 € [0, 2n) such that z = \z\ e lS , then LLPO holds. 

The classical proposition “sequential continuity implies pointwise continuity” is 
equivalent to the essentially nonconstructive principle BD-N that we discuss later, 
in Section 6.3. 

When taken with the Church-Markov-Turing thesis, Exercise 13 shows that it 
is impossible that there exist a sequentially discontinuous function from R to R, 
since LPO is false in RUSS. It is, however, a far cry from showing that the existence 
of a sequentially discontinuous mapping on R implies LPO to proving that every 
mapping / : R — > R is pointwise continuous. The latter can be done with the aid 
of either Brouwer’s continuity principle or else both the Church-Markov-Turing 
thesis and Markov’s principle; see Chapters 3 and 5 of [34]. 

In connection with Exercise 1, Fred Richman has shown us the following choice- 
free proof that if Ra is complete, then either a = 0 or a / 0. Assume that Ra is 
complete. For each £ > 0 we have either i/fo| < £ or else |a| > 0; in the latter case, 

a/H = ±“7= € Ra - 

vM 

Since £ > 0 is arbitrary, y/ ja[ € Ra = Ra and there exists r such that -y/j o| = ra. 
Pick a positive integer N > r. Either |a| > 0 or |a| < 1/N 2 . In the latter case, if 
a/0, then 

\r\ |a| = \ra\ = y/\a\ = J^L= > N |a| , 

vM 

so |r| > N, a contradiction; whence a = 0. 

Exercise 3 and an extended version of Proposition 3.1.6 originated in [24]. 
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Exercise 6 deals with a weak converse of Bishop’s lemma. There seems to be 
no obvious use for such converses, in spite of the extreme value of Bishop’s lemma 
itself. 

The notion of “weakly discriminating”, and Exercises 11 and 12, come from [37]. 

Exercise 14 produces a constructive substitute for the highly nonconstructive 
Bolzano- Weierstrass theorem; it first appeared in [30] . For more constructive analy- 
ses of the Bolzano- Weierstrass theorem see [27] and [22]. 

For more on Baire’s theorem, see [34] (Chapter 2) and Chapter 6 below. 
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Finite-Dimensional and Hilbert Spaces 


A mind that is stretched by a new experience can never go back to its old dimen- 
sions. 

— Oliver Wendell Holmes, attrib. 


We first examine finite-dimensional spaces, including an application of the A-technique to 
the problem of finding best approximations by elements of a finite-dimensional subspace. 
We then introduce Hilbert spaces, which are natural generalisations of finite-dimensional 
Euclidean spaces. 


4.1 Finite-Dimensional Spaces 

Let X be a linear space equipped with a compatible inequality. Finitely many 

n 

vectors ei, . . . ,e n in X are said to be linearly independent if A ^ 0 for all 

2=1 

n 

scalars Ai, . . . , A n such that ^ |A*| > 0. In that case, if the A* are scalars such that 

2=1 

n 

Y A ,ej = 0, then A; = 0 for each i. We say that the space X is finite- dimensional 
2=1 

if either X = {0} or else it contains finitely many linearly independent vectors 
e l7 ... ,e n such that for each x £ X there exist scalars Ai, . . . , A n for which 

n 

x = A (4.1) 

2=1 

In the first case we say that X has dimension 0 or is 0-dimensional. In the second 
case we say that X has dimension n or is n- dimensional, we call {ei, . . . ,e n } a 
basis of/for X , and we say that the space X is spanned by, or is the (linear) span 
of, the set {ei, . . . , e n j . We denote the dimension of a finite-dimensional space X 
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by dim(AT). Since any two bases of X have the same number of vectors, dim(A) is 
well defined. 

If {ei, . . . , e n } is a basis for X, then for each x € X, the coordinates A,; in the 
representation (4.1) are uniquely defined; so there are well-defined, clearly linear, 

n 

coordinate functionals Ui : X — > K such that x = Y Ui(x)ei for each x in X. We 

i = 1 

shall prove that each of the coordinate functionals is a bounded linear map relative 
to any norm on X. 

Lemma 4.1.1. Let e be a nonzero vector in a normed space X. Then the subspace 
Y = Ke is locally compact, and = —Y. 

Proof. The mapping A A ||e|| 1 e is an isometric isomorphism of K onto Y, so Y 
is locally compact and therefore located. Since locally compact spaces are complete, 
it follows from Bishop’s lemma (Proposition 3.1.1) that = —Y. □ 


Lemma 4.1.2. IfY is a finite- dimensional subspace of a normed space X, then Y 
is located and = —Y. 

Proof. The case dim(F) = 0 is trivial, and the case dim(F) = 1 is dealt with in 
Lemma 4.1.1. Assume that all subspaces of dimension at most n ^ 1 in all normed 
spaces are located, and that their complements and metric complements coincide. 
Consider an (n + l)-dimensional space Y with basis vectors e±, . . . , e n+ \. Let Z be 
the n-dimensional space spanned by {ei, . . . , e n j . First note that for all Ai, . . . , A„ 

n 

in K, since 1 + ^ |A*| > 0, the linear independence of the vectors ei,...,e n +i 

i= 1 

n 

gives e„+i ^ YL A . Hence e ra +i belongs to and therefore, by our induction 
2—1 

hypothesis, to —Z; whence 


II ^n+l || x I Z ~ P ( e n+ 1) Z) > 0- 

It follows that Ke ra+ i is a 1-dimensional subspace of X/Z. Now, for each x £ X, 

p (x, Y) = inf {||x — te n +i — z\\ : t £ K, z £ Z} 

= inf {p (x — te n +i,Z) :tsK} 

= inf |||x - te n+ i\\ x/z : t £ k| , 

which exists, since, by Lemma 4.1.1, the 1-dimensional subspace Ke n +i is located 
in X/Z. Hence Y is located in X. 

Now consider an element x of ~Y. For all t, £ K and z £ Z we have x ^ te n +i + z 
and therefore x — te n + 1 ^ z. Thus x — te n + 1 £ ~Z, and therefore, by our induction 
hypothesis, 
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\\x - te n+ i\\ x / z = p{x - te n+1 ,Z) > 0. 

This shows that x yf x /z Ae n+ 1 ; whence x is in the complement of Ke n+ i relative 
to the quotient norm on X/Z. Again applying our induction hypothesis, this time 
to the 1-dimensional subspace Ke n +i of X/Z , and denoting by p x /z the distance 
corresponding to the quotient norm on X/Z , we see that 


0 < Px/z M Ke n+ i) = p (x, Y ) , 

so x £ —Y. Hence C —Y and therefore = —Y. This completes the induction 
step. □ 


Proposition 4.1.3. Let {ei, . . . , e n j be a basis for an n-dimensional normed space 
X. Then the corresponding coordinate functionals are bounded linear functionals. 

Proof. In the case n = 1, the sole coordinate functional is the linear mapping Aei -w 
A, which, being an isometry (that is, distance preserving), is trivially continuous. 

n 

Now consider the case n ^ 2. For each x £ X write x = y~) In order to prove 

i=l 

that the coordinate functional Uk is bounded, we may relabel, if necessary, to take 
k = n. Let Z be the ( n — l)-dimensional subspace of X with basis {ei , . . . , e n _i} . 
Since, as in the proof of Lemma 4.1.2, e n £ ~Z, it follows from that lemma that 
p(e n ,Z) > 0 and hence that Ke„ is a 1-dimensional subspace of X/Z. Moreover, 
for each x £ X we have 


I x/z = P Ui(x)ei, Zj = p (u n (x)e n , Z) 
= \\u n {x)e n \\ x i z = |w rt (a;)| ||e n || x / z 


and therefore 


\u n (x)\ = 


1 


Mix /z ^ 


1 


£ . 


W^nWX/Z W^nWx/Z 

Thus 1/ 1 1 e n 1 1 X j Z is a bound for the linear functional u n . 


□ 


Corollary 4.1.4. Every linear mapping from a finite- dimensional normed space 
into a normed space is bounded. 

Proof. Let u be a linear mapping of a finite-dimensional normed space X into a 
normed space Y. We may assume that dim(A) > 0. Let {ei, . . . , e n } be a basis of X , 
and, using Proposition 4.1.3, compute a common bound c > 0 for the corresponding 
coordinate functionals iq (1 ^ i ^ n). For each x £ X we have 
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Nz)ll = 


y ^Uj(x)ei 


vt=l 


y ^Uj(x)u(ej) 


i= 1 


< 


||u i (ar)u(e i )|| = M*)! II u ( e i) 


i = 1 


i- 1 


< ( c 5Z ii u ( e i)ii ) imi • 


i= 1 


So u is a bounded linear map. 


□ 


Corollary 4.1.5. Let {e\, . . . ,e n } be a basis for an n-dimensional normed space 
X. Then the canonical mapping 


(Ai, . . . , X n ) ^ Xjej (4-2) 

»= l 

is a bounded linear injection of the Euclidean space K™ onto X , and its inverse is 
a bounded linear injection. 

Proof. It is routine to verify the linearity of this map and its inverse, and that 

the maps are injective. The continuity of the mappings follows immediately from 

Corollary 4.1.4. □ 


Proposition 4.1.6. A finite- dimensional normed space is locally compact. 

Proof. The O-dimensional case is trivial. Consider a finite-dimensional normed space 
X with a basis {ei, . . . , e n }. Let u : K n — > X be the bounded linear mapping de- 
fined at (4.2). By Corollary 4.1.5, a -1 is a bounded linear map. Hence for any 
bounded subset B of X , u~ 1 (B) is a bounded subset of the locally compact space 
K" and is therefore contained in a compact subset K of K™. Since u is injective and 
its inverse mapping is uniformly continuous, it is strongly injective. It follows from 
Proposition 2.2.16 that u(K), which clearly contains B , is a compact subset of X. □ 


Corollary 4.1.7. The unit ball of a finite-dimensional normed space is compact. 

Proof. Let X be a finite-dimensional normed space, and B its unit ball. By Propo- 
sition 4.1.6, there exists a compact subset K of X that contains B. Now, for each 
x G X, 

p(x, B) = max{0, 1 — ||x||} 

exists, so B is totally bounded (by Proposition 2.2.10). Being also closed in K, it 
is complete and therefore compact. □ 
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Corollary 4.1.8. Every linear mapping from a finite- dimensional normed space 
into a normed space is normed. 

Proof. Let u be a linear mapping from a finite-dimensional normed space X into 
a normed space Y. We may assume that dim(X) ^ 1. By Corollary 4.1.4, u is a 
bounded linear mapping and hence is uniformly continuous on X. In particular, 
this implies that u maps the compact (by Corollary 4.1.7) unit ball of X onto a 
totally bounded subset of Y; so 

IMI = sup {||u (a;) || :xGX, ||x|| < 1} 

exists. □ 

Two norms || || , || on a vector space X are said to be equivalent if both the 
identity mapping from {X, || ||) onto ( X , || || ) and its inverse are continuous; since 
those mappings are linear, it follows from Proposition 2.3.3 that || || and || are 
equivalent norms on X if and only if there exist positive constants a, b such that 
a ||x|| < ||x|| , < b ||x|| for all x € X. 

Corollary 4.1.9. Any two norms on a finite- dimensional space are equivalent. 

Proof. If || || and || are two norms on a finite-dimensional space X , then, by 
Corollary 4.1.4, both the identity mapping from (X, || ||) to (X, || jf) and its in- 
verse are bounded, and hence continuous, linear mappings. □ 

We want to prove the converse of Corollary 4.1.7. This requires three lemmas, 
the second of which will have several applications later in the book. 

Lemma 4.1.10. Let X be a normed space, Y an n-dimensional subspace of X with 
basis {ei, . . . , e n }, and e a vector such that p(e,Y) > 0. Then the linear span of 
7U{e) is (n + 1)- dimensional, with basis {e\, ... ,e n ,e}. 

Proof. We need only prove that the vectors e\, . . . ,e n ,e are linearly independent. 

n+1 

To this end, consider elements A*(l^i<n+l)ofK. such that ^ M >0- Either 

i= 1 

n 

A n +i ^ 0 or ^2 | Ai| > 0. In the first case, 

i= 1 

n 

A jej + A n +ie ^ | A n +i | p (e, Y) > 0. (4-3) 

i= 1 

n 

In the second case, since e \, . . . , e n are linearly independent, ^ i e i > 0- Hence 
either ||A„+ie|| > 0, so that A n+ i ^ 0 and we have (4.3); or else ||A„+ie|| < 

n 

^2 A and therefore 

i= 1 
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n 

A iei + A ra _|_ie 

i - 1 




E 

i— 1 


A 


||An+ie|| > 0. 


Thus in all cases we have Aje* + A„+ie ^0. □ 

i= 1 

Lemma 4.1.11. Lei S' &e i/ie span of a finitely enumerable set {x\, . . . ,x n } in a 
normed space X, and let £ > 0. Then there exists a finite- dimensional subspace Y 
of S such that p(xi,Y) < e for each i. 

Proof. Setting Xo = {0} , suppose that for some k < n we have constructed finite- 
dimensional subspaces X 0 C X 1 C • • • C X k C S such that p(x i: X k ) < e for 
1 ^ i ^ k. Then either p(x k + i,X k ) < s, in which case we set X k+ \ = X k , or else 
p(x k + i,X k ) > 0. In the latter case we take X k +\ to be the span of X k U {xfc+i} , 
which is finite-dimensional by the preceding lemma. This completes the inductive 
construction of the finite-dimensional subspace X k +\. It remains to take Y = X n . □ 


Lemma 4.1.12. (Riesz’s lemma) Let Y be a closed located subspace with an 
inhabited metric complement in a normed space X, and let 0 < 9 < 1. Then there 
exists a unit vector x £ X such that ||ar — j/|| > 8 for each y £ Y. 

Proof. Fix xq £ —Y. Then 

0 < r = p(xo, Y) < 9~ 1 r. 


Choosing y 0 £Y such that 


let 


r < ||*o - 2 / 0 1| < 8 V, 

E = 77 h(*0 - 2/0 )• 

ll^o - 2/o II 


Then ||x|| = 1. Also, for each y £ Y, 

2/o + | |a;o - 2/o 1 1 V € Y. 

Hence 

ll^o - 2/o 1 1 II* - 2/11 = ll*o - (2/o + ||*o - 2/o | 
and therefore 


> p(xo,Y) = r 


I 7! — 7/11 > 


> 0. 


11*0 - 2/0 1 | 


□ 
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Proposition 4.1.13. A locally totally bounded normed space is finite- dimensional. 

Proof. Let B be the unit ball of a locally totally bounded normed space X. Then 
B is located in X. By Proposition 2.2.18, B is locally totally bounded; being 
bounded, it is therefore totally bounded. Let {x\, . . . ,x n } be a 1/4-approximation 
to B , and, using Lemma 4.1.11, construct a finite-dimensional subspace Y of X 
such that p(xi,Y) < 1/4 for each i. Let £ be any point of X , and suppose that 
p (£, Y) > 0. Then, by Riesz’s lemma, there exists a unit vector x £ X such that 
|| x — y\\ > 1/2 for all y £ Y. But this is absurd: for since x £ B, there exists i such 
that || cc — x»|| < 1/4 and therefore p(x,Y) < 1/2. We conclude that p(£, Y) = 0; 
so £ is in the closure of Y. But Y, being finite-dimensional and therefore locally 
compact (Proposition 4.1.6), is closed in X. Hence £ £ Y. Since £ £ X is arbitrary, 
we see that X = Y. □ 


Corollary 4.1.14. A located subspace of a finite- dimensional normed space is 
finite- dimensional. 

Proof. Let Y be a located subspace of a finite-dimensional normed space X. Since, 
by Proposition 4.1.6, X is locally compact, we see from Proposition 2.2.18 that Y 
is locally totally bounded. Hence, by Proposition 4.1.13, Y is finite-dimensional. □ 


By a convex combination of finitely many elements Xi, ... ,x n of X we mean 

n n 

a point of the form A iXi where each A ^ 0 and A,; = 1. The convex hull 

i— 1 i = 1 

of a subset S of X is the closure of the set of all points of X that are convex 
combinations of points of S. It is straightforward to show that the convex hull of S 
is the intersection of all convex subsets of X that contain S. 

According to Exercise 11 of Chapter 2, a located subset of M may not have its 
metric complement located. However, things are different when the set is convex. 

Proposition 4.1.15. If S is a located convex subset of the product normed space 
K" such that —S is inhabited, then —S is located. 

Proof. Consider any x in R", and for each r > 0 let B\{x,r) be the closed ball 
with centre x and radius r relative to the product metric p. Then B\{x,r) is an 
n-dimensional cube with centre x and sides of length 2 r. Consider any two real 
numbers a, /3 such that a < (3. Let v \, . . . , be the vertices of B\ (x, |(a + /?)). 
It is left as an exercise to show that there exists 5 > 0 such that for all points 
Wi , . . . , u> 2 « with 

p(vi,Wi ) <6 (1 < i < 2”) , 

the convex hull of {wi, . . . ,W 2 *} contains B i(x,o). Either p(vi, S) < S for all i, 
or p(v i, S) > 0 for some i. In the first case, for each i in {1, . . . , 2"} choose Wi in 
S such that p(vi,Wi) < 5; then S contains the convex hull of {wi , . . . , and 
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therefore contains B i(x,a). Thus p{x,y) ^ a for all y in —S. On the other hand, 
if for some i we have p(vi, S) > 0, then u* £ —S and p(x, Vi) 0 \{a + 0) < (3. Thus 
p(x, S) exists, by the constructive least-upper-bound principle. □ 


Although we do not develop integration theory in this book, it is worth pointing 
out here that for convex subsets of R" there is a close link between locatedness and 
Lebesgue measurability: a convex subset of R" with inhabited interior is located 
if and only if it is Lebesgue measurable [17]. Informally, this result shows that a 
convex subset of R" with inhabited interior can be located if and only if its size can 
be calculated. 


4.2 Best Approximation 

Let V be an inhabited subspace of a metric space X, and let a, b be elements of 
X, V respectively. We say that b is a best approximation, or closest point, to a in V 
if p{a,b) ^ p{a,v ) for each v £ V. In that case, p(a,V) exists and equals p(a,b). 
We call V proximinal in X if each element of X has a best approximation in V , in 
which case V is located. 

The fundamental theorem of classical approximation theory says that a finite- 
dimensional subspace of a normed space is proximinal. Constructively, this theorem 
implies LLPO (see Exercise 1). However, by introducing the idea of at most one 
object even without knowing in advance that there exists one, we can produce a 
good constructive version of the fundamental theorem. 

Let X, V, and a be as in the first paragraph of this section. We say that a has 
at most one best approximation in V if for all distinct points v, v' in V, there exists 
x £ V such that 

max {p(a, v), p(a, ?/)} > p(a,x). 

We call V quasiproximinal if each point of X with at most one best approxima- 
tion in V actually has a (perforce strongly unique) best approximation in V. Our 
destination in this section is the following constructive fundamental theorem of 
approximation theory. 

Theorem 4.2.1. Every finite- dimensional subspace of a real normed space is qua- 
siproximinal. 

We defer the proof until we have prepared the pathway with some preliminary 
results, one of which has a proof that uses the A-technique discussed in Chapter 3. 

Lemma 4.2.2. Let {ei, . . . , e n } be a basis for an n-dimensional subspace V of 
a normed space X, let 1 0 m < n, and let W be the subspace of X with basis 
{ei, . . . , e m } . Then the span of {e m +i, . . . , e n } is an (n — m)- dimensional subspace 
of the quotient space X/W. 
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Proof. The proof is relegated to Exercise 4. 


□ 


Lemma 4.2.3. Let x,e be elements of a real normed space X with e ^ 0, and for 
each S > p(x, Re) write 


Sg = {t € R : ||® — te\\ ^ 5} . 

If Ss is compact, then it is a proper compact interval [ m,M ] in R. Moreover, 

\\x — me\\ = 5 = \\x — Me\\ . (4.4) 

Proof. Suppose that Sg is compact, with infimum m and supremum M. By Corol- 
lary 2.2.14, there exists S' such that p(®,Me) < S' < 5 and Sg> is compact. Let 
m' and M' denote the infimum and supremum, respectively, of Sg’. The uniform 
continuity of the map t \\x — te || on R ensures that (4.4) and 

II® — m'e || = S' = ||® — M'e || 

hold. Since Sg> C Sg and S' < S, it follows that m < ml ^ M' < M and therefore 
m < M. Now consider any t G [m, M] . Writing 


we have t = (1 — r) m + tM. Hence 

||® — te|| ^ (1 — r) ||® — me|| + r ||® — Me || = (1 — r) 5 + tS = 5 
and so t G Sg. Thus Sg = [m, M] . □ 


Lemma 4.2.4. Let x,e be elements of a real normed space X with e / 0, and let 
d ^ 0. Suppose that 

max {||® — ie|| , ||® — t'e\\} > d 

whenever t, t' are distinct real numbers. Then there exists r G M such that 
||® — re|| > d entails p(®,Me) > d. 

Proof. Without loss of generality, we may assume that p(®,Me) < d+ 1. With Sg 
as in Lemma 4.2.3, we construct an increasing binary sequence (A„) ra ^i, sequences 
(ajn^ii of real numbers, and a decreasing sequence (<An) n>1 of positive 

numbers with the following properties: 

> S$ n is the proper compact interval [a ni b n \ ; 

> if A„ = 0, then p (®, Me) < 6 n < d + 1/n, and if n ^ 2, 

2 

0 bn tin h- — {b n — i a n —\) ; 


(4.5) 
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> if A„ = 1, then p (x, Re) > d, a n = a n - 1 , b n = b n - 1 , and <5 n = 

To begin the construction, set Ai = 0 and, using Corollary 2.2.14, choose <5i > 0 
such that p (x, Re) < c>i < d+ 1 and Sg 1 is compact. By Lemma 4.2.3, S$ 1 is a proper 
compact interval [ 01 , 61 ]. Now suppose we have constructed A„_i, a n _i, b n ~i, and 
6 n - 1 with the applicable properties. If A„_i = 1, set 


X n — 1, Q n &n— l) b n — b n — i, and S n — bn— i- 


If A„_i = 0, write 


r = max < x - -a„_i + -b n -x e , x - -a„_i + - 6 „_i e 


By our hypotheses, r > d\ so either p (x, Me) > d or else p (x, Me) < min {r, d + 1/n} . 
In the first case we define our numbers as at (4.6). In the second case we set A„ = 0 
and choose S n such that 


p (x, Me) < S n < min l S n -\, r,d -\ — 

I n 


and Ss n is compact; then by Lemma 4.2.3, S$ n is a proper compact interval [a n , b n 
contained in Ss n _ 1 . Since r > 6 n , either 


x ~ (\ a n-i + ^b n - 1) e > 5 n 


X - 1 + ^n-1^) e > 6 n . 

Taking, for example, the first case, and using the convexity of S$ n , we see that 
either 

[ 1 . 2 A 

CZ u n _i, —(x n — i T —b n —\ 1 


/ 1 . 2 

6 n C [ —CL n — i T —b n —i., b n — i 

Hence (4.5) holds. This completes our inductive construction. 
We see from (4.5) that for n ^ 2, if A n = 0, then 


0 < b n — a n ^ ( - J (bi — a±) 


and therefore 


0 ^ a n+ 1 - a n < ( - J ( 6 i - ai) . 
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Clearly, this last inequality also holds if A„ = 1. It follows that {a n ) n>1 is a Cauchy 
sequence and therefore converges to a limit r in R. Supposing that || a: — re || > d , 
compute a positive integer N such that ||a; — a^e || > d + 1/N. If X N = 0, then 

\\x - a N e\\ = 5 N < d+ 

a contradiction. We conclude that A at = 1 and hence that p(x,Re) > d. □ 


We now have the proof of Theorem 4.2.1. 

Proof. Let V be a finite-dimensional subspace of a normed space X over R, and let 
a be a point of X with at most one best approximation in V. If V has dimension 
0, then there is nothing to prove. If dim(C) = 1, then V = Re for some e ^ 0 in 
V, and we can apply Lemma 4.2.4 with x = a and d = p{a 1 V) to construct r £ R 
such that || a — re|| = p (a, V). 

Now let n be a positive integer, and suppose we have proved the desired result 
for all n-dimensional subspaces of real normed spaces. Consider the case where V 
has a basis {ei, . . . , e n +i} , and let Y = Re„+i. By Lemma 4.2.2, V/Y is an n- 
dimensional subspace of X/Y with basis {ei, . . . , e„}; moreover, for each x £ X we 
have 

p{x,V) = inf j||x- v\\ x/Y : v G . (4.8) 

Next, note that 

(*) for each v £ V there exists a G R such that if ||a — v — ae„ + i|| > p (a, V) , then 
ll a ~ v \\x/y = P (a - v, Re n+ i) > p (a, V) . 

For if t, t' are distinct real numbers, then 

|| (v + te n+ 1 ) - (v + t , e„+i)|| = 1 1- t'\ ||e n+ i|| ± 0 

and therefore, by our hypotheses, 

max{||a - v - te n+ i \\ , ||a - v - t' e n+1 \\} > p{a,V). 

So we can apply Lemma 4.2.4 with x = a — v, e = e n+ \, and d = p(a,V) to 
compute the desired a. 

With v. a as above, now let v' be a point of V distinct from v, and compute 
a' G R such that if ||a — v' — a'e n+ \^ > p (a, V ) , then 

ll a “ v '\\x/y = P(. a - v ' i Re«+i) > p{a,V) . 


We have 
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|| (v + ae n+ i) - (v' + a'e n+ i)\\ = ||(u - v') + (a - a') e n+ i\\ 

^ II' 1 ’ — v II X/Y > 

so, by our hypotheses, 

max {||a — v — ae n+ i|| , || a-v' - a’ e n+1 \\] > p(a, V) . 

It follows from (4.8) and the defining properties of a, a’ that 

max j||a-v|| x/ Y- ,\\a ~ v’\\ x/Y } > p(a,V ) = inf j||a - v\\ x/Y : v G . 

We have now shown that a has at most one best approximation in the n-dimensional 
subspace V/Y of X/Y. By our induction hypothesis and (4.8), there exists vq € V 
such that 

p(a - Vo, Ke„_|_i) = ||o - v 0 \\ x/Y = inf |||a - v\\ x/Y : v G v| = p (a, V) . 
Applying (*) once more, we compute to GM. such that 

||a - u 0 - fo e n+ill = p(a,V) . 

Hence vq + toe n +i is th e b es t approximation to a with respect to the original norm 
on X , and our inductive proof is complete. □ 


Theorem 4.2.1 is classically equivalent to its classical counterpart. For if V is a 
finite-dimensional subspace of the real normed space X and we suppose that some 
a G X has no best approximation in V, then a clearly has at most one best approx- 
imation in X. Therefore, by Theorem 4.2.1, it has one, which is a contradiction. 

For applications of Theorem 4.2.1, see [15]. 


4.3 Hilbert Spaces 


As we did for metric and normed spaces, we assume that the reader is familiar with 
the definitions and elementary properties of an inner product and an inner product 
space. In particular, we are not going to prove that 

Nl = \/{x,x) 

defines the norm associated with a given inner product on a vector space X over 
K, and that the Cauchy -Schwarz inequality 

\(x,y)\ < INI IMI 


and the parallelogram identity 
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\\x + y\\ 2 +\\x-y\\ 2 = 2\\xf + 2\\y\\ 2 

then hold. The latter inequality enables us to prove that an inner product space X 
is uniformly convex, as follows: If 0 < S < 1, and x,y are unit vectors in X with 
\ ||x + y\\ > 1 — 5, then 

Ik - yf = 2 INI 2 + 2 \\yf ||x + y\\ 2 < 4 - 4 (1 - <5) 2 , 

which can be made as small as we please, independently of x and y, by a suitable 
initial choice of 6. 

The simplest example of an inner product space is, of course, the Euclidean 
space K" with the usual inner product given by 

n 

{x, y) = Y1 x *y* > 

i= i 

where x = (xi, . . . , x n ) and y = (t/i, . . . , y n ) . Another example is the space k(K), 
consisting of all sequences x = (x n ) n>1 in K that are square summable in the sense 
00 2 

that ^2 \x n | converges in R; in this case we work with termwise operations and 

n = 1 

with the inner product defined by 

OO 

( x ,y) = ^2x n y* n . 

n = 1 

It is left to the exercises to prove that this is indeed an inner product. 

An inner product space that is complete with respect to its norm is called a 
Hilbert space. Adding completeness to the inner product leads to a structure with 
powerful geometrical properties such as the following. 

Theorem 4.3.1. Let S be a closed, located subspace of a Hilbert, space H. Then for 
each x £ H, there exists a strongly unique element Px of S such that ||x ~ Px\\ = 
p(x, S). Moreover, Px is the strongly unique element y of S such that (x — y,s) = 0 
for all s £ S. 

Proof. Fixing x £ H, let d = p (x, S) and choose a sequence (s„) n>1 in S such that 
d = linin^oo |k — s n ||. Using the parallelogram identity and the convexity of S, we 
have 

| km - Sn|| 2 = || km ~ x) - (s n - x)|| 2 

= 2 pm - X\\ 2 + 2 ||Sn - X|| 2 - 4 

< 2 ||sm - x|| 2 + 2 ||sn - x|| 2 - 4 d 2 
= 2 (ikm - x\\ 2 - d 2 ) + 2 (j|s„ - x|| 2 - d 2 ) 

— > 0 as m,n 



OO. 
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Hence (s„) n>1 is a Cauchy sequence. Since H is complete and S is closed, this 
sequence converges to a limit Px £ S. The continuity of the norm on H now gives 
||x — Px || = d. Moreover, if y £ S and y ^ Px, then, again by the parallelogram 
identity and convexity, we have 

0 < \\Px - y\\ 2 = || {Px -x)-(y- x)\\ 2 

= 2 II -Pat - at 1 1 2 + 2 || y- x\\ 2 - 4 

< 2 (||Pat - at|| 2 - d 2 ) + 2 (||y - at|| 2 - d 2 ) 

= 2 (||j/ — 2 t|| 2 — c? 2 ) , 

so 1 1 at — 2 /|| > d. It follows that Px is the strongly unique closest point to x in S. 
Next note that for all y £ S and A £ I\, 

(x — Px + A y, x — Px + A y) ^ d 2 = (x — Px, x — Px) 


Px + y 


— x 


and therefore 

|A| 2 |M| 2 + 2 Re (A* (x - Px, y)) > 0. (4.9) 

Suppose that Re ( x — Px,y ) ^ 0. By choosing a sufficiently small real number A 
with A Re (x — Px, y) < 0, we can contradict (4.9). It follows that Re ( x — Px, y) = 
0. A similar argument shows that Im (at — Px,y) = 0; whence (a % , — Px,y) = 0. 
Finally, if y £ S and y ^ Px, then since y — Px £ S, 

0 < \\y — -Pat|| 2 = (y - Px, y - Px) 

= (x- Px, y - Px) - (x - y, y - Px) = - (x - y, y - Px ) , 

so (x — y,y — Px) ^ 0. It follows that Px is the strongly unique element y of S 
such that (x — y, s) = 0 for all s £ S. □ 


The mapping P : H — > S defined in Theorem 4.3.1 is called the projection of 
H onto S, and for each x £ H, the vector Px is the projection of the vector x onto 
S. 

The mapping P is linear: for if A £ IK and x, x' £ H, then since for all y £ S, 

(Xx + x' — (A Px + Px'),y) = X(x — Px, y) + (x 1 — Px', y) = 0, 

it follows from the uniqueness of the projection of Aa; + x 1 onto S (Theorem 4.3.1) 
that P (Xx + x') = X Px + Px' . For each y £ S, since (y — y,s) =0 for all s £ S, we 
see from the uniqueness of the projection of y into S that Py = y, whence P maps 
H onto S. Since Px £ S, we have P 2 x = P (Px) = Px, so P is idempotent: that is, 
P 2 = P. Also, 
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||a;|| 2 = (x, x) = (x — Px + Px, x — Px + Px) 

= (x — Px, x — Px) + (x — Px, Px) + {Px, Px) 

= ||a;--Pa;|| 2 + ||Pa:|| 2 , 

since (x — Px, Px) = 0 (by Theorem 4.3.1). Hence ||Px| < ||x|| , and so 1 is a bound 
for P. On the other hand, if S contains a nonzero vector x, then ||Px|| = ||x|| ; it 
follows that in this case, P is normed and ||P|| = 1. 

Two subsets S, T of an inner product space are said to be orthogonal if (x, y) = 0 
for all x £ S and y £ T; we then write S 1 T. The orthogonality relation is 
symmetric, 

sirens, 

and any family (Tj) ig/ of subsets of A satisfies 

S X IJP) •<=>• Vi e 7 (S' iiTj) . 

iei 

A vector x is orthogonal to the subset S if {x} _L S, in which case we write liS; 
two vectors x, y are said to be orthogonal vectors if if {y}, in which case we write 
x X y and we have the following generalisation of Pythagoras’s theorem: 

\\x + y\\ 2 = \\x\\ 2 + \\ y f. 

We define the orthogonal complement of a subset S' of A to be the set 

S ± = {x £ A : x 1 S} , 


which is easily seen to be a closed linear subspace of A. 

If 0 € S, then 5 0 = {0} . It readily follows that if S is a closed, located 

subspace of a Hilbert space H, with P the corresponding projection, and if x £ H, 
then the decomposition 

x = Px + (x — Px) 

is the unique expression of x as the sum of a vector in S and a vector orthogonal 
to S. Denoting by I the identity operator x x on H, we prove that I — P is the 
projection of H onto the subspace S “ L . For each y £ S -1 , the vector z = x — Px — y 
belongs to S -1 , so 

II * - yf = II Px + *|| 2 = || P *|| 2 + INI 2 > || Pz || 2 = ||* - Or - Px)\\ 2 . 


Since x — Px £ £ lL , we conclude from the uniqueness part of Theorem 4.3.1 that 
x — Px is the projection of x into S' ± . 


We need at this point to clarify what we mean by saying that the series ^ Ajej 

iei 


converges to the sum x and by writing 


X — ^ ( X iCi , 


i&I 


(4.10) 
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where the index set / on the right is not necessarily countable: we mean that for 
each e > 0 there exists a finitely enumerable subset F of I with the property that 
if G is a finitely enumerable subset of I that contains F, then 

x — ^ A iCi < e. 
iSG 

In the case I = N + , this condition is equivalent to the usual one for convergence of 

OO 

the series ^ X n e n to x. 

n—1 

A family ( ei) ieI of vectors in a Hilbert space F[ is said to be orthonormal if 

t> ei _L ej whenever i ^ j , and 
> for each i, either 1 1 e* 1 1 = 1 or e.; = 0. 

Such a family is called an orthonormal basis if each vector x € Ft can be written 
uniquely in the form (4.10) with (A,) igJ a family of elements of K such that A j = 0 
whenever e, = 0. The scalar A j is then called the ith coordinate of x relative to the 
orthonormal basis. 

If (ef) ieI is a finitely enumerable orthonormal family in H, then it spans a finite- 
dimensional subspace of if. To see this, we may assume that 1 1 e* 1 1 = 1 for each i. 

Let (A i) ieI be a family of scalars such that |Aj| > 0, and let x = A je* . Choose 

ie/ iei 

j € I such that Xj ^ 0. Then 

(x, ej) = 'y ' A i (ej, ej) = A j (ej, ej) = Xj ^ 0, 
iei 

so x 0. Thus the vectors e* ( i € f) are linearly independent and therefore span a 
finite-dimensional space. 

Classically, using (an equivalent of) the axiom of choice, we can prove that every 
Hilbert space has an orthonormal basis of unit vectors. Constructively we avoid 
the axiom of choice by adding separability to the hypotheses on if, by relaxing 
the requirements to allow basis vectors to be 0, and by using the Gram-Schmidt 
orthogonalisation process embodied in the proof of our next result. 

Proposition 4.3.2. Every separable Flilbert space has a countable orthonormal 
basis. If ( e n ) n> i is such a basis, relative to which x has coordinates a\, ct 2 , ■ ■ ■ and 

OO 

■y has coordinates (3 1 ,f3 2 i ■ ■ ■ ■> then a n = (x, e n ) for each n, (x, y) = a nP* n , and 

n—1 

oo 

M 2 = E Kl • 

n=l 

Proof. Let (a n ) n>1 be a dense sequence in H. The idea of the proof is to construct 
inductively the orthonormal sequence (e n ) ra>1 to ensure that at stage n there exists 
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z n in the subspace S n of H spanned by {ei, . . . , e n j such that \\a n — z n || < 1/n. For 
convenience, set eo = 0 and So = {0}. Assume that e n has already been constructed. 
Being finite-dimensional, S n is closed and located, so the projection P n of H onto 
S n exists. Either ||a ra _|_i — P n a n + 1 || < 1/ (n + 1) or else a n + 1 / P n a n + ±. In the first 
case, set e n+ \ = 0. In the second, define a unit vector by 


e n + 1 — II jj II ( a n + 1 Pn a n+ 1) • 

H&n+l n Un+l|| 


Note that e n +\ is orthogonal to eo, . . . , e n since (by Theorem 4.3.1) a n + ± — P n a n +i 
is orthogonal to S n , and that 


&n + 1 — Pn&n+l T ||^n+l An^n+ll ^n+l 

is in the subspace of H spanned by {ei, . . . , e n +i}. This completes the induction. 

Now consider the vectors x, y € H. For each positive integer k we have unique 
representations 

k 

Pk% — ^ \ CK n 6 n 
n— 1 

and 

k 

PkU = ^ ^ f3 n €-m 

n— 1 

with a n = (3 n = 0 whenever e n = 0. For n ^ k, 
k 

Ot-n ^ ^ (C-ii €n) & n ) (*£? ^n) Pk^i & n ) (*^5 ^n) • 

i = 1 

Likewise, /3 n = (y, e n ) . Thus a n and (3 n do not depend on any k ^ n. Now, 

k 

( P k x,P k y ) = ^a n /3*. 

n— 1 

Since the sequence (a n ) n>1 is dense in X and p(ak,Sk) < 1/A:, we see that both 
p ( x , Sk ) and p ( y , Sk) approach 0 as k — > oo; whence PkX — > x and Pky — » y as 
k — > oo. Thus 

OO 

(x,y) = lim ( P k x,P k y ) = V^cW?*. 

k — > oo z — 

n— 1 

2 °° 2 

Finally, if we take y = x, we obtain ||a:|| = |a„| . □ 

n=l 

An elementary lemma will enable us to characterise dimensionality in terms of 
orthonormal bases. 
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Lemma 4.3.3. Let S be a finite- dimensional subspace of a separable Hilbert space 
H , and let (e ra ) n>1 be an orthonormal basis of H. Then there exists a positive integer 
N such that p (e n , S) > 0 whenever n ^ N and ||e n || = 1. 

Proof. By Corollary 4.1.7, there exists a finite 1/2-approximation {x\, . . . ,x m j to 

°° 2 

the unit ball of H. Since, by the preceding proposition, the series ^ |(£fc,e n )| 

n = 1 

converges, |(xfc, e n )\ — » 0 as n — > oo; so we can compute N such that 

3 

|(x fc ,e„)| < - (1 < k < to; n > N) . 

O 

Let P be the projection of H on the finite-dimensional, hence closed and located, 
subspace S. Consider any n ^ N such that ||e„|| = 1. Choosing k (1 ^ k < m) such 
that || Pe n — Xk | < 1/2, we have 

|| Xk || — 1 4“ || Xk || 2 Re {Xk , Cn) ^ 1 2 | (Xk , C n ) | ~ , 

SO 

P (^-n ? — W^n P&n || ^ || Xk || || Pc n Xk || ^0, 

as we required. □ 


Proposition 4.3.4. Let (e n ) ra>1 be an orthonormal basis of a separable Hilbert 
space H. Then H is finite- dimensional if and only if e n = 0 for all sufficiently large 

n. 

Proof. If H is finite-dimensional, then taking S = H in Lemma 4.3.3, we obtain 
N such that e n = 0 for all n ^ N. If, conversely, such N exists, then H is the 
finite-dimensional space of all linear combinations of those vectors ei, . . . , ejv that 
are nonzero. □ 


Let X be a linear space with an inequality compatible with its linear structure. 
We say that X is infinite- dimensional if the complement of each finite-dimensional 
subspace of X is inhabited. It follows from Bishop’s lemma (Proposition 3.1.1) that 
a normed space X is infinite-dimensional if and only if for each finite-dimensional 
subspace S of X there exists x € X such that p (x, S ) > 0. 

Proposition 4.3.5. Let (e n ) n>1 be an orthonormal basis of a separable Hilbert 
space H. Then H is infinite- dimensional if and only if e n 0 for infinitely many 

n. 

Proof. Suppose that H is infinite-dimensional, and consider any positive integer N. 
Let S be the finite-dimensional subspace of H consisting of all linear combinations 
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of e\ , . . . , ej\r, and let P be the projection of H onto S. There exists x £ H such 
that p(x, S) > 0 and therefore, by Proposition 4.3.2, 

OO 

X] \{x, e n }\ 2 = \\ x - Px\\ 2 = p(x,S) 2 > 0. 

n=N+l 

Hence there exists n > N such that (x, e n ) ^ 0 and therefore, by the Cauchy- 
Schwarz inequality, e n ^ 0. 

Conversely, if there exists a strictly increasing sequence (n^ j fc>1 of positive inte- 
gers such that e nk ^ 0 for each k, consider any finite-dimensional subspace S of H. 
Choose N as in Lemma 4.3.3. Then choose k such that Uk > N. Since ||e„ fc || = 1, 
we must have p (e„ fc , S) > 0. □ 


We now introduce a construction of new Hilbert spaces from old. Let Hi, H 2 be 
Hilbert spaces, and for all x = (aq, £ 2 ) and y = ( 2 / 1 , 2/2) hr the product vector space 
Hi x H 2 define 

(x,y) = (xi, yi) + (* 2 , 2 / 2 ) • 

This defines an inner product with respect to which Hi x H 2 is a Hilbert space, 
called the direct sum of Hi and H 2l and denoted by Hi ® H 2 . This construction 
helps us to prove the most general case of our next theorem. 

We say that a linear functional u on a Hilbert space H is represented by a vector 
a £ H if u{x) = (x, a) for all x € H. The following Riesz representation theorem 
tells us that a functional is representable by a vector in this way if and only if it is 
normed. 

Theorem 4.3.6. A bounded linear functional u on a Hilbert space H is normed if 
and only if there exists a £ H such that 

u(x) = (x, a) (x £ H) . (4-11) 

In that case, a is strongly unique: if y £ H and y ^ a, then there exists x £ H such 
that u(x) ^ ( x,y ) . Moreover, ||it|| = ||a|| . 

Proof. Suppose first that there exists a vector a with property (4.11). Then, by the 
Cauchy-Schwarz inequality, |u(*)| ^ ||a|| ||*|| for each x £ H. On the other hand, 
given e > 0, we have either ||a|| > 0 or ||a|| < e. In the first case, 

u (m°) = ^ > ^ ~ £ ' 

In the second, u(0) = 0 > ||a|| — e. Since £ > 0 is arbitrary, it follows that ||u|| exists 
and equals ||a|| . 
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Suppose, conversely, that u is normed. To begin with, take ||«|| > 0. Then ker(w) 
is located, by Proposition 2.3.6. Let P be the projection of H on ker(w), choose 
y £ H with u (y) >0, and define 


x 0 


1 

u(y) 


(y - Py ) • 


Then u(x o) = 1, so x — u{x ) xq € ker(u). Since xq _L ker(u), we have 


0 = (x - u(x) x 0 ,x 0 ) = (x,x 0 ) - u(x) ||x 0 || 2 , 


from which it follows that u(x) = (x, a) with 

1 


It remains to remove the condition that ||it|| > 0. In doing so, we need to bear in 
mind that we have not ruled out the possibility that H = {0}. To deal with this, 
we consider the direct sum H © K, on which we define a bounded linear functional 
v by 

v{x,Q = u{x) + C- 

We first observe that u is represented by the vector a if and only if v is represented 
by the vector (a, 1). Since ||u|| > 0, it follows from the first part of the proof that 
it will suffice to prove that v is normed. To this end, let 0 < a < f3 and set 
£ = \{a + ff). If ||«|| > 0, then, by the first part of the proof, u is represented by a 
unique vector a £ H; whence v is represented by (a, 1) and is therefore normed. So 
we may assume that ||u|| < e. Then either ||u|| + 1 > a + £ or ||u|| + 1 < [3. In the 
first case, u(0, 1) = 1 > a and ||(0, 1)|| = 1. In the second case, for each (x, £) with 
|| (a;, C) || Owe have 

\v{x, C)| < \u(x)\ + |C| < ||u|| + 1 < /3. 

It now follows from the least-upper-bound principle that ||i>|| exists. □ 


The second part of the foregoing proof contains an argument showing that the 
sum of two normed linear functionals on a Hilbert space is normed. Note that in 
general, the sum of two normed linear functionals on an arbitrary nontrivial normed 
space need not be normed. 

By an operator on a Hilbert space H we mean a linear mapping of H into itself. 
The set of bounded operators on H is denoted by B(H). 

Classically, the Riesz representation theorem enables us to prove, for a given 
element T of B(H ), the existence of the adjoint T*, which has the defining property 


(Tx,y) = (x,T*y) (x, y £ H) . 


(4.12) 
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Indeed, given y £ H, we apply the Riesz representation theorem to the linear 
functional x ( Tx,y ) to obtain a unique vector T*y with the desired property. 
The constructive problem with this argument is that in order to apply Theorem 
4.3.6, we require that the functional x (Tx, y) be normed, which is something we 
cannot guarantee. In fact, as the following Brouwerian example shows, we cannot 
prove that a general normed operator on H has an adjoint. 


Let H be a complex Hilbert space with an orthonormal basis (e n ) n>1 of unit 
vectors, and let (a„) n>1 be a binary sequence with at most one term equal to 1. 

OO 

Observe that for each x € H the series Y a n (x, e n ) converges absolutely. For, 

71=1 

°° 2 

given £ > 0 and choosing N such that Y \{x,e n )\ < £ 2 , for each k > N we 

71=iV+l 


have 


k 


k 


Y \a n {x,e n )\= Y a n |(ar, e n )\ < max |(x,e„)|<£. 

n=N-\-l n=N-\-l 


oo 

Hence the partial sums of Y a n \(x, e n ) \ form a Cauchy sequence, and so the series 

71=1 

OO 

Y « n (x, e n ) converges in C. Thus 

71=1 


Tx = 




. (x,e n ) 


ei 


defines a mapping — clearly an operator — from H to itself. It is left as an exercise 
to show that T is normed. Suppose that T* exists. Then either T*ei yf 0 or else 
||T*ei|| < 1. In the first case we can find N such that (T*ei, ejv) yf 0. Suppose that 
a N = 0. If there exists n yt N such that a n = 1, then for all x,y £ H, 


(Tx, y) = ((x, e n ) e x ,y) = (x, (y, ei) e n ) 


and therefore 

T*y=(y, ei )e n . (4.13) 

Hence (T*e i, e n) = 0, a contradiction. Thus a n = 0 for all n yf and therefore for 
all n; whence T* = 0, which is impossible since T*e i yf 0. We conclude that a^r = 1- 
In the case ||T*ei|| < 1, suppose that a n = 1. Then (4.13) holds, so T*e i = e n and 
therefore ||T*ei|| = 1, a contradiction. It follows that in this case we have a n = 0 
for all n. 

In view of this Brouwerian example, for any Hilbert space H and any not nec- 
essarily bounded operator T on H, we define the adjoint T * , if it exists, by the 
equation (4.12); in which case we refer to T as jointed. It is then straightforward to 
show that T* is an operator; that the adjoint of T* is T; and that any bound for T 
is one for T* , and vice versa. Moreover, if S, T are jointed operators, then for each 
A £ I, so are A S + T and ST, and 
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(A S + T)* = A *S* +T*, 

(ST)* = T*S*. 

For example, we have 

(STx, y) = (Tx, S*y) = (x, T* ( S*y )) = (x, T*S*y ) . 

An operator T on H is called selfadjoint , or Hermitian , if T* exists and equals T. 
The identity operator I is trivially selfadjoint. More generally, if P is the projection 
of H onto a closed, located subspace, then for all x,y € H we have 

( Px , y) = ( Px , Py) + ( Px , y — Py) = ( Px , Py) 

= {Px, Py) +{x- Px, Py) = (x, Py) , 


so P is selfadjoint. 

Conversely, if P is any bounded, idempotent, selfadjoint operator on H , then P 
is a projection. To see this, let 

V = {y € H : Py = y) . 

It is clear that His a linear subspace of H. If x £ H, then since P 2 x = Px, we have 
Px £ V. On the other hand, for each y £ V we have 

(x - Px, y) = (x, y) - (Px, y) = (x, y) - (x, Py) = 0, 

since Py = y. Hence x — Px is orthogonal to V, and therefore (since Px — y is in 

V) 

\\ x - y\\ 2 = \\ x - Px + Px- y\\ 2 

= ||ar - Px || 2 + || Px - 3 /|| 2 > ||x - Px || 2 . 

We now see that Px is a closest point to x in V. Hence V is located in H. Finally, 
the continuity of the bounded operator P ensures that V is closed; so P is the 
projection of H onto V. 

We end the chapter with an application of the Riesz representation theorem 
and a corollary, both due to Ishihara [58]. For this we need to know that a linear 
mapping T between normed spaces X, Y is defined to be compact if T(Bx( 0, 1)) is 
a totally bounded subset of Y ; in that case, the norm of T exists, by Corollary 2.2.7. 
Every bounded linear mapping on a finite-dimensional normed space is compact. 

Proposition 4.3.7. Let T be a bounded linear mapping of a Hilbert space H into 
C", and for 1 ^ i ^ n let Pi : C" — > C be the ith projection mapping, defined by 

Pi(z 1 , Z'2 , - - • , Zn) . 

Then T is compact if and only if Pi oT is normed for each i. 
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Proof. If T is compact, then since the projection Pi is uniformly continuous, the 
set Pi oT(5(0, 1)) is totally bounded, so \\Pi o T\\ exists. Suppose, conversely, that 
Pi o T is normed for each i. By the Riesz representation theorem, for each i there 
exists di £ H such that 

Pi oT(x) = (x, af) (x £ H) . 

Using Lemma 4.1.11, construct a finite-dimensional subspace X of H such that 
p(a,i, X) < e/2 n for each i. Let P be the projection of H onto X. Since the restriction 
of T to X is a compact linear mapping, there exist X\,X 2 , - ■ ■ , x m in the unit ball Bx 
of X such that {Tx\,Tx 2 , ■ ■ ■ ,Tx m } is an e/2-approximation to T{Bx)- Consider 
any x £ H with ||a:|| ^ 1. Working with the product norm on C n , we have 

n 

\\T(x-Px)\\=^2\PiO T (x-Px)\ 

2=1 

n 

= (x - Px, di) 

n 

= “ P ) a i) 

n n 

< 11*11 ll ai “ Pa ill < ^ = 

2=1 2=1 

Choosing j such that \\TPx — Txj\\ < s/2, we now obtain 

\\Tx - Txj\\ < \\T(x - Px ) || + || TPx - Txj\\ < | + | = e. 

Thus {Txi, Tx 2 , ■ ■ ■ , Txm) is an e-approximation to T(Bh). Since e > 0 is arbi- 
trary, we conclude that T is a compact linear mapping on H . □ 


Corollary 4.3.8. The sum of two compact operators on a Hilbert space is compact. 

Proof. Let S and T be compact operators on a Hilbert space H , and let e > 0. 
Since S(Bh) and T(Bh) are totally bounded, it readily follows from Lemma 4.1.11 
that there exists a finite-dimensional subspace X of iL, with associated projection 
P , such that 

\\(S + T)x-(PS + PT)x\\<e ( x&B h ). (4.14) 

We may assume that X is nontrivial and so has a basis {ei, . . . , e n j . Writing 

n n 

PSx = y ^Pi(x)ej, PTx = qj(x)ej, 

i=l i=l 

we see from Proposition 4.3.7 that for each i, the linear functionals Pi,qt on H are 
normed; whence pi + qi is normed (see the remark following the proof of the Riesz 
representation theorem). Since 
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(PS + PT) ( x ) = ^ (P* + ®) ( a ’) e i> 

i=l 

we see from Proposition 4.3.7 that PS + PT is compact. In view of (4.14), any 
e-approximation to (PS + PT) ( Bh ) is a 2e-approximation to (S + T)(Bh)- Hence 
S + T is compact. □ 


Exercises 

1. Let A be the space R 2 with norm ||(x, y)|| = max{|x| , |y|} . Show that if the 
1-dimensional subspace R (cos 9 , sin 9) is proximinal in X for each 9 £ R, then 
LLPO holds. 

2. Let V be a nonzero linear subspace of a normed linear space X such that each 
x £ X has at most one closest point in V. Need V be located? 

3. Prove that if every pair of nonzero vectors in R 2 generates a finite-dimensional 
space, then LPO holds. 

4. Prove Lemma 4.2.2. 

5. Prove that a uniformly convex linear subspace of a normed space is proximinal. 

6. A normed space X is said to be compactly generated if there exists a compact set 
K C X such that each point of A is a linear combination of finitely many points 
of K. Let A be a compactly generated Banach space, and Y a finite-dimensional 
subspace of A. Prove that either A = Y or else the metric complement of Y is 
inhabited. (Hint: Use the A-technique.) 

7. Let Y be a locally compact subspace of a complete metric space A such that 
each x € X has a unique best approximation Px in Y. Prove that the mapping 
P is sequentially continuous on A. (Hint: First use Ishihara’s tricks.) 

8. Let T be a bounded linear mapping of a normed space A onto a finite- 
dimensional Banach space Y. Prove that there exists r > 0 such that By( 0, r) C 
T(B X ( 0,1)). 

9. Let T be a bounded linear mapping of a normed space A onto a finite- 
dimensional Banach space Y. Prove that T is a compact linear mapping if and 
only if ker(T) is located in A. 

10. Let A be an inner product space. Prove that ||a;|| = \J (x, x) defines a norm 
on A' and that the Cauchy-Schwarz inequality \(x,y)\ < ||a;|| ||y|| holds for all 
x,y € A. 
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11. Let p ^ 1, and let l p be the set of all sequences x = {x n ) n>1 in the field K 

OO 

that are p-summable, in the sense that \ x n\ P converges. Equipping l p with 

n—1 

termwise operations of addition and multiplication- by-scalars, prove that 


= £ 


i/p 


defines a norm on l p , and that l p is complete with respect to this l p -norm. 

In the case p = 2, show that the Z p -norm arises from an inner product, as in 
Exercise 10. 

12. Let S, T be orthogonal linear subspaces of a Hilbert space H such that 

S + T = {x + y : x £ S, y £ T} 
is dense in H. Prove that S and T are both located. 


13. Let S' be a linear subset of a Hilbert space H such that for each x £ H there 
exists y £ S with x — y orthogonal to S. Prove that S is closed and located. 

14. Construct a Brouwerian counterexample to the statement that every selfadjoint 
operator on a Hilbert space H is normed. Does every normed operator on H 
have an adjoint? 

15. Construct a Brouwerian counterexample to the statement that every selfadjoint 
operator on a Hilbert space that has located kernel also has located range. 

16. Construct Brouwerian counterexamples to each of the following statements. 

(a) Every bounded linear functional on 1 2 (see Exercise 11) is compact. 

(b) If T is a bounded linear mapping of l\ into C 2 such that Pi o T is normed 
for i = 1,2, then T is compact (where Pi is the itli projection of C 2 onto 
C). 

17. Construct a Brouwerian counterexample to the proposition that the sum of two 
compact linear mappings between normed spaces is compact. 

18. Use the A-technique as an alternative means of removing the restriction that 
||u|| > 0 in the proof of the Riesz representation theorem. 

19. Let 1Z be an algebra of normed, jointed operators on a complex Hilbert space H 1 
where the operation of multiplication on 7 Z is just the composition of operators. 
For each T £lZ and each A £ C define an operator T' on the direct sum H © C 

by 

T'(x, C) = ( Tx + Xx, AC) ■ 

Show that T' is both selfadjoint and normed. 
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Notes 


Our definition of “linearly independent” is classically equivalent to the usual one: 

n 

namely, if ^2 Aj = 0, then A^ = 0 for each i. Constructively, the two definitions 

i — 1 

are equivalent if and only if MP holds. Bishop’s definition of “finite-dimensional” 
requires that the space have a norm from the outset. Our definition is more in the 
spirit of linear algebra. 

Theorem 4.2.1 can be applied to the case of Chebyshev approximation: best 
approximation of elements of C [0, 1], relative to the sup norm, by polynomials of 
degree at most n. However, in this special case a deeper analysis enables one to prove 
the existence of best approximations without the use of Theorem 4.2.1; moreover, 
that analysis reveals that the best approximation process is, as one would expect 
in a constructive context, continuous. See [15]. 

The induction step in the proof of Theorem 4.2.1 is a lot simpler if we are 
permitted to use Brouwer’s fan theorem; for we can then show that any uniformly 
continuous function from a compact metric space to the positive real numbers has 
a positive infimum (see [34] , Chapter 6) . 

As Section 4.3 shows, the elements of Hilbert space theory require very few 
modifications to bring them into constructive line. However, we have to be careful 
about the requirement that the linear functionals be normed before we can apply 
the R.iesz representation theorem, and about the possibility that a given bounded 
operator may not have an adjoint. 

For a different proof of the Riesz representation theorem see [35] . Working with 
a direct sum, as was done in the proof of Theorem 4.3.6 in order to circumvent 
our inability to decide whether a space is trivial, is a useful technique in other 
applications; see [28]. 

In the proof that an idempotent, bounded, selfadjoint operator P on a Hilbert 
space is a projection, we used boundedness only to prove that the set V = 
{x £ H : Px = x} is closed. The Hellinger-Toeplitz theorem (see Chapter 6) shows 
that every selfadjoint operator P on H is sequentially continuous, a property strong 
enough to prove that the corresponding set S is closed. Thus, in fact, every idem- 
potent selfadjoint operator on a Hilbert space is a (bounded linear) projection. 
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Linearity and Convexity 


Every separation is a link. 

— Simone Weil, ‘Metaxu’, Gravity and Grace 

We begin the chapter by exploring some geometric aspects of convexity that are used 
later in the construction of one of the cornerstones of functional analysis: the separation 
theorem. In turn, this leads us to the Hahn-Banach extension theorem, which in its most 
general form for separable normed spaces produces only approximately norm-preserving 
extensions of normed linear functionals. We also give Ishihara’s version of the Hahn- 
Banach theorem, which provides a unique norm-preserving extension of a given normed 
linear functional in the case where the norm on the space is Gateaux differentiable. We then 
use the separation and Hahn-Banach theorems to explore the interplay between a normed 
space and its dual. In particular, we characterise certain linear functionals on spaces of 
bounded linear mappings. For our discussion of duality we develop the fundamentals of 
the theory of locally convex topological vector spaces. 


5.1 Crossing Boundaries 

Suppose we start at a point £ in the interior of a located subset C of a normed 
space X and move linearly towards a point z in the metric complement of C. Are 
we able to tell when we are crossing the boundary 

dc = c n 

of C? In general, as is discussed in more detail in problems at the end of this chapter, 
the constructive answer is no. However, our geometric intuition suggests that when 
C is convex, we might succeed in pinpointing boundary crossing points. Our first 
few lemmas are designed to lead us to a proof of the existence of the boundary 
crossing point, which depends continuously on the origin £ and the terminus 2 of 
our path out of the convex set. 
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Lemma 5.1.1. Let C be a convex subset of a normed space X, let f £ C°, and let 
r > 0 be such that B (£, r) C C. Let z ^ £, 0 < t < 1, and z' = t£ + (1 — t ) z. If the 
ball B(z,tr) intersects C, then B(z',t 2 r ) C C. 


Proof. Suppose that there exists y in B(z,tr) fl C. Let Cf £ B(z',t 2 r) and 


Then 




f-r = 




< (^-i) \U-y\\ + JlK-C'l 


i 


t 


i 


< 1 ) tr H — t r 


t 


Hence £f G C. Since = tt; + (1 — t) y, it follows by convexity that G C. □ 


In the context of a vector space X over IK we define intervals as follows: 

[x, y\ = {tx + (1 - t) y : 0 < t < 1} , 

{x, y) = {tx + (1 - t) y : 0 < t < 1} , 


where x, y G X. 

Lemma 5.1.2. Let C be an open convex subset of a normed space X such that 
C U —C is dense in X , let (GC, and let z G —C. Then ( C U — C) fl [£, z\ is dense 
in [£, 2 ] . 

Proof. Fix e > 0 and choose r such that 

B(f, r) C C, and B(z, r) C —C. Given y in [£, z] , we may assume that ||y — £|| > r/2 
and || y — z\\ > r/2, so y = at; + (1 — a) z for some a G (0, 1) . Fix A such that 

— £ — Oi I 

K^r^l <A<0 - (5 - 1) 


and set 
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t = 


A 

A — 1 ’ 


yi = + (1 - A) y. 


Then 0 < t < 1 and y = tf + (1 — t) y-\ . Also, 

Vi = t 1 - (! - A)(l - a)] C + (1 - A) (1 - a) z, 

where, by (5.1), 


0 < (1 - A) (1 - a) < 1, 
so 1/1 G [£, z \ . Now pick y' £ C U — C such that 

\\y-y'\\ < min{r 2 ,t 2 r} . 

Take first the case y' G C. By our choice of r, the point £ + £ (y — y') belongs to C. 
Since C is convex, 


y" = + ytt (t + 1 (y ~ 2/')) 

also belongs to C. Moreover, 

y" = G K. v \ c & 2 ] - 

and 

lly - v"\\ = IIC - ylK 115 - z\\ < e. 

1 + r 1 + r 

We are left with the case y' € — C to dispose of. But then B(y,t 2 r ) intersects 
— C, so, by the preceding lemma, B(y 1 ,tr) fl C = 0; whence 2/1 G — C. Since 

\\y-yi\\ = l A l ll£ -2/11 < ^ 

by (5.1), the proof is complete. □ 


Lemma 5.1.3. Let X be a normed space, let X\,X 2 be distinct points of X, and 
let x$ = Aaii + (1 — A) X 2 with A ^ 0, 1. For all a, (3 > 0, if ||x — xi|| < a/ |A| and 
\\y - X 2 \\ < / /3/1 1 - A|, then || \x + (1 - A) y - cc 3 1| < a + 0. 

Proof. For such x and y we have 

1 1 As + (1 - A) 2 / - x 3 || ^ |A| \\x - a;i || + |1 - A| \\y - x 2 \\ , 


from which the result follows almost immediately. 


□ 
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In the presence of convexity, the preceding, seemingly innocent, lemma turns 
out to be a powerful tool. Here is a first example of its use. 

Lemma 5.1.4. If C is an inhabited, open, convex subset of a normed space X , then 
—C is dense in ~C. 

Proof. Fixing £ G C, choose r > 0 such that B (£, r) C C. Consider any 2 G 
and £ > 0. Setting 


and 

X 2 = tz + (1 - t) £, 

we see that || a; 2 — z\\ = £ and that 


■e 


-t 
i - 1 


z + 


1 


1 - 1 


X 2 . 


It now suffices to show that x 2 G —C. Taking X\ = z,x 3 = £, and A = t/(t — 1) in 
Lemma 5.1.3, we see that if ||y — x 2 \\ < r (t — 1), then 


—t 
1 - t 


z + 


1 

1 - 1 


y-t, 


< r 


and therefore 


If also y G C, then 


—t 
1 - t 


z + 


1 

l-t 


y&C. 


z 




-t 

l-t 



e C, 


a contradiction. Hence y C whenever \\y — x 2 \\ < r (t — 1); in other words, 
B(x 2 ,r(t — 1)) C -’C, from which it follows that x 2 G —C. □ 


A more significant example of the application of Lemma 5.1.3 is the following. 

Proposition 5.1.5. Let C be an open convex subset of a Banach space X such 
that C U —C is dense in X, and let f G C. For each z G —C and each t G [0, 1] 
write 

z t = tf + (1 - t) z. 

Then the following hold: 

(a) 7(£, z) = inf {t G [0, 1] : zt G C} exists, and 0 < y(£, z) < 1. 

(b) £ 7 (£ )2 ) is the unique intersection of [£, z] with dC. 



(c) If y(£, z) < t < 1, then z t € C. 

(d) If 0 ^ f < y(£, z ), then z t € —C. 
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Moreover, the mapping (£, z) Z i(£,z) °f C x — C TOto 9C is continuous at each 
point of C x —C. 

Proof. Fixing z in —C, let 


S — {t £ [0, 1] : Zt £ C 1 } , 

which contains 1. Note that since C is convex, S is convex also, from which it follows 
that 


(*) if 0 < t' < 1 and z t > ^ C, then z t ^ C for all t with 0 <t <tf. 

Let 0 < a < (3 < 1. By Lemma 5.1.2, there exists to G (a,/?) such that the point 
Vo = tof + (1 — to) ^ belongs to C U — C . Either to £ S or y 0 € —C D [£, z] . In the 
latter case, since £ € C n [^, 2 ] , we see from (*) that to, and hence a, is a lower 
bound for S. It follows from the least-upper-bound principle that 

m = y(£, z) = inf S 

exists. Choosing r > 0 such that B (£, r) C C and B (z, r) C — C, now observe that 

then 0 < t < 1 and ||£ — (tf + (1 — t) z)|| = r; whence tf + (1 — t) z € C. Thus 

r 


to < 1 — 


U~z\ 


< 1 . 


On the other hand, since 


\\ z ~ (1 - t) £ + Ml =r 

and therefore z t £ —C, we see from (*) that 

r 


to ^ 


U~z\ 


> 0 . 


By the definition of to as an infimum, the point 

h{z) = mf + (1 — to) 2 

belongs to the closure of C. Again applying Lemma 5.1.2, we can find t' £ (0 , to) 
such that z t i belongs to CU — C and is arbitrarily close to z rn . By the definition of 
to, we have z t / (f C, and so z t > £ —C. Thus z m is in the closure of —C and hence 
in dC. The uniqueness part of (b) will follow immediately once we have proved (c) 
and (d). 
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Now observe that if m < t ^ 1, then by the definition of m as an infimum, 
there exists t' such that m < t! < t and z t ' € C; since z t is in [£, z t >}, it is in the 
convex set C. This proves (c). To dispose of (d), let 0 < t < m and let y be the 
midpoint of the segment [£, z m \ . Then, by (c), y £ C, so there exists 6 > 0 such 
that B ( 77 , (5) C C. Also, 77 = A z m + (1 — A) z t for some A > 1. Consider any point 
■y with || y — z t || < <5 / (A — 1). Applying Lemma 5.1.3 with x\ = z m ,x 2 = Zt, and 
£3 = y, we see that 

II A z m + (1 - A) y - 77H < S 

and therefore A z m + (1 — A) y £ C. If also y £ C, then 

1 

which is absurd since z m £ dC and C is open. Thus 

b ( z, 'a"vi) C " c 

and therefore z t £ —C. This proves (d). 

To prove the continuity of the boundary crossing map on C x — C, fix £ £ C, 
z £ —C, and £ > 0. Using (c) and (d), choose a, 6 , s such that 0<a<m<6<l, 
s > 0, 

B(z a ,s) C -CnB(z m ,£) , 

and 

B (Zb, s) C C nB(z m ,£) . 

Consider points z' £ —C and £ £ C with 

max{||,' — ,||,]| 4 '-{||}<|min{l l.j-C.j-C}. 

For each t £ [0,1] set 

z' t = tg + (1 - t) z'. 

Taking x-y = £, £2 = z,x = (, , and y = z' , and applying Lemma 5.1.3 with A = a, we 
obtain ||^ — z a \\ < s and therefore z' a £ — CCiB(z m , e). On the other hand, applying 
Lemma 5.1.3 with A = b, we see that ||z£ — Zb\\ < s and therefore z' b £ CnB(z m ,e). 
It follows that x ) is in the segment (z' a ,z' b ), which lies in B(z m ,£ ); whence 
<e - D 


Zm — ^ (1 1/) 


For fixed ( £ C, we call the mapping z z 7 (£, z ) in the foregoing proposition 
the boundary crossing map of C relative to £. We shall use the existence of exact 
boundary crossings out of convex sets in the next section. 
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5.2 Separation Theorems 


Developing the theme of convexity, in this section we approach the fundamental 
theorems on the separation of points and convex sets by hyperplanes. In turn, this 
material will lead us in the next section to the Hahn-Banach theorem, one of the 
cornerstones of functional analysis. 

A subset C of a vector space X over IK is called a cone if for all x,y € C and 
all t > 0, both x + y and tx belong to C. In that case, C is convex. The closure of 
a cone is a cone, as is the intersection of two cones. 

If K is a convex subset of X, then the set 


c(K) = {tx : x £ K,t > 0} 


is a cone. For clearly, if x £ c(K), then tx £ c(K ) for all t > 0; whereas if 
xi,X 2 € c(K), then X\ = t\y\ and X 2 = ^22/2 for some y\ 1 y 2 € K and some 
ti,<2 > 0, so Xi + X 2 = (t i + t. 2 ) 2 with 


z = 


tl 

t\ + t'2 


Vl + 


^2 

tl + t2 


V2 e K. 


We call c ( K ) the cone generated by the convex set I\. If A is a normed space and 
I\ is open, then so is c(K). 


Lemma 5.2.1. Let I\ be a bounded located subset of a normed space X, a £ X, 
and let t > 0. Then the set 

S = {tx + (1 — t) a : x £ K , 0 < t < r} 

is located. 


Proof. Choose R > 0 such that ||x|| ^ R for all x £ K. Fixing xq £ X, note first 
that for each t > 0, 

fit) = p (x 0 , {tx + (1 - t) a : x £ K}) 


exists and equals 


, ' 1 t - 1 . 

tp ( -x 0 H —a, K ) . 


If also t' > 0, then since 

||z 0 -tx-{l-t) a|| < ||x 0 - t'x - (1 - t') a|| + (||a|| + ||x||) 1 1 - t'\ 
for all x £ X, we see that 

It follows that the mapping / is uniformly continuous on R + ; whence p(xo,S) ex- 
ists as the infimum of / on the totally bounded interval (0, r) . □ 
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Lemma 5.2.2. Let K be a bounded, located, convex subset of a normed space X 
such that p{ 0, K) > 0, and let a € X be such that p(—a, c(K)) > 0. Then there 
exists r > 0 such that || tx + (1 — t) a|| ^ r for all x € K and t > 0. 

Proof. Let 


<5 = 


p(0,I<) 


3 (IMI + p(0, K)) ' 

Given x in K and t > 0, we have either |1 — t\ > 5 or |1 — t\ < 26. In the first case, 


|| tx + (! — £) a|| 


|1 -*l 


1 - t 


■ + (-a) 


> 5p{-a,c{K)). 


In the case |1 — t\ <26, 

\\tx+ (1 — i) a|| >t\\x\\ -\l-t\ ||a|| 

>(l-26)p(0,K)-25\\a\\ 

= \p{ 0,K). 

Setting 

r = min c(K)), ^p(0, A')| > 0, 

we see that || tx + (1 — t) a|| ^ r in either case. 


□ 


Lemma 5.2.3. Let K be a bounded, located, convex subset of a normed space X 
such that p(0,K) > 0. Then c(K) is located. 

Proof. Fix xo € X. For each t > 0 and each x £ K we have 

Iko - toll > |£| ||a;|| - Moll • 


Hence 


p ( xo , tK) ^ t.p (0, K) — Moll — > oo as t — > oo. 
Compute r > 1 such that 

p(x 0 ,tK) > p(x 0 , K) (t>r- 1) . 


(5.2) 


Then 

d = inf {p Mo, tK) : 0 < t < r} 

exists, by the case a = 0 of Lemma 5.2.1. Since r > 1 and therefore d <, p (xq, K), 
it follows from (5.2) that p(x 0 ,c(K)) exists and equals d. □ 



5.2 Separation Theorems 


115 


Lemma 5.2.4. Let I\ and L be open cones in a normed space X such that K U L 
is dense in X and I\ C ~L. Then 

(a) K C — L and L C —K, 

(b) K U —K and L U —L are dense in X, and 

(c) I\ and L have a common boundary — namely, K n L. 

If also L = {— x : x £ K}, then d K is a subspace of X. 

Proof. Since L C C ~A", we see that K U ~AT is dense; whence, by Lemma 
5.1.4, KU—K is dense. On the other hand, K C ~A and K is open, so K C —L and 
therefore K C —L. But —L is open and K U L is dense, so —L C K and therefore, 
by Lemma 5.1.4, = — L C K. Hence 

K = = XJl. 

Interchanging the roles of K and L, we obtain L U — L dense and 

L = ^K = ^K. 


We now have 

dK = K n~~K = K nL = ~~Lr\L = dL. 

Since K, L, and therefore their closures are all cones, so is OK. 

Now suppose also that L = {— x : x € A'}, and consider x £ dK = ATfl L. There 
exist points y £ K and z £ L arbitrarily close to x. Then — y £ L, —z £ K, and 
these two points are arbitrarily close to — x\ whence —x belongs to dK. It follows 
that dK is a subspace of X. □ 


Lemma 5.2.5. Let K be a located subset of a normed space X , and let r > 0. Then 
the set 

K r = {x £ X : p[x, K) cf r} 
is located, and for each xo £ X, 

p{ xq, K r ) = max {0, p(x o, K) — r} . (5-3) 

Proof. If x £ K r and £ > 0, then there exists y £ K such that ||ar — y\\ < r + e\ so 

||*o - *11 > ll*o - 2/|| - II* - 2/11 > d(* o, K)-r-e. 

Since £ > 0 is arbitrary, we have 


||*o - *|| ^ p(*o> K) - r 


(x £ K r ) . 


(5.4) 
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On the other hand, choosing z £ K such that ||a;o — z\\ < p(xo,K) + e, we have 
either ||aro — z\\ < r, in which case (5.3) holds with each side equal to 0, or else 
1 1 Xq — z 1 1 >0. In the latter case, writing 




we have z + C £ K r and 


11*0 _ ( z + Oil — 


1 — min < 1 


11*0 - 2 : 

= max {0, ||x 0 - z\\ - r} 

< max {0, p[x o, K) — r} + £ 


(*o - z) 


Since £ > 0 is arbitrary, it follows from this and (5.4) that (5.3) holds. 


□ 


By a half-space of a normed space X we mean a convex subset K such that dK 
is a hyperplane and the set 


{x £ X : x £ K V —x £ A'} 


is dense in X. 

We are now ready for the basic separation theorem. Our proof illustrates an 
important observation about classical proofs using Zorn’s lemma (see [65]): for 
separable spaces it is often possible to replace such a proof by a constructive one 
that uses an induction argument. 

Theorem 5.2.6. Let X be a separable normed space, Kq a bounded, located, open, 
convex subset of X such that p(0, Kq) > 0, and x$ a point of X such that —Xo £ Kq. 
Then there exists an open half-space K of X such that Kq C K, p(xo,K) > 0, and 
the boundary of K is a located subspace of X that is a hyperplane with associated 
vector xq. 

Proof. Let (. x n ) n>1 be a dense sequence in X. The basic idea of the proof is this: we 
carry out a succession of located convex enlargements of Kq such that for n ^ 1 , the 
cone generated by the nth enlargement K n is close to at least one of the points x n 
and — x n , and such that the union of the cones c(K n ) is the desired open half-space. 
The idea may seem simple, but the details are rather complicated. 

To be more precise, we construct bounded, located, open, convex subsets Ao C 
A'i C AT 2 C • • • of X such that the following properties obtain for n ^ 1: 


(a) p(x o, c(K n )) > (1 - 2 n )p (x 0 , c(K n _i)) 

(b) p (0, K n ) > 0, 
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(c) max {p (x n , c(K n )) , p (—x n , c(K n ))} < 

n 

To that end, assume that K 0 , . . . , K n _\ have been constructed with the applicable 
properties. Then c(K n _ i) is located, by Lemma 5.2.3. Either 

min {p(x n , c(K n _i)), p(—x n , c(K n _i))} < -, 

n 

in which case we take K n = K n _\, or else both the numbers p(x n ,c(K n _i)) and 
p{— x n , c(K n -i)) are greater than l/2n. In the latter case, writing 

I<+ = {tx + (1 - t) x n : 0 < t < 1, x £ K n _i} , 

K~ = {tx — (1 — t) x n : 0 < t < 1, x € K n _ij , 

we obtain two bounded, open, convex sets that, by Lemma 5.2.1, are located. We 
show that at least one of the sets Jv+, K~ satisfies our requirements for K n . Since 
K n _ i is open, for each x £ AT„_ i we can find A > 1 such that Xx — (A — 1) x n £ 
K n - 1 ; so, by convexity, 

1 ~~ 1) Xn + \ Xn ) e K n- 

Hence K n _ 1 C A'+ : likewise, K n _ 1 C K~ . Since, by our hypotheses, p(Q, K n _i) > 
0 and p(—x n ,c(K n -i)) > 1/2 n, we see from Lemma 5.2.2 that p(0,AT+) > 0; 
similarly, p{0,K~) > 0. 

Next, given z + £ c(A"+) and z - £ c(K~), find t + ,tr > 0 and x + ,x~ £ c(K n _ i) 
such that z + = t + x n + x + and = —t~x n + x~ . We have 

t~ 1 1 .tq — z + 1| + t + 1 1 So — || 

= t~ || £Co - ( t + x n + X + ) || + t + ||xo — (—t~x n + x ~ ) || 

^ || (t + + t ~ ) Xo — ( t~x + + t + x ~ ) || 

= (t+ + t - ) ||*o - 

> (t + + t~) P (x 0 , c(lv n _i)) 

> ( t + + t~) (1 - 2~ n ~ 1 ) p (x 0 , c(K n - 1 )) . 

Hence either 

r (||x 0 - z + \\ - (l - 2 _n ~ 1 ) p (x 0 , c(A' n _i))) > 0 
or 

t + ( 1 1 xq - z~\\ - (l - 2~ n ~ 1 ) p (x 0 , c(AT„_i))) > 0, 
from which we obtain 

max { ||xo - z + \\ , ||x 0 - z“||} > (l - 2~ n ~ 1 ) p (x 0 , c(AT„_i)) . 
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Taking the infimum as z + ,z~ run over c(K+), c(K~) respectively, we obtain 

max {p(x 0 , c(K+)), p(xo, c(K~ ))} ^ (l - 2~ n ~ 1 ) p (x 0 , c(A'„_i)) 

> (l — 2~ n ) p (xq, c(K n _i)) . 

Thus either 

p(x 0 ,c(K+)) > (l - 2~ n ) p(x 0 ,c(K n - 1 )) , 

in which case, noting that x n £ Kn and therefore p (x n , c(K+)) = 0 < 1/n, we 
take K n = K + ; or else 

p(x 0 ,c(K~)) > (l - 2~ n ) p(x 0 ,c(K n _i )) , 

when we take K n = K~ . This completes the inductive construction of K n . 

The set 

OO 

I<= \Jc(K n ) 

n— 0 

is easily shown to be an open convex cone. Moreover, we see from (a) that 
(p(xo, c(K n ))) n>1 is a Cauchy sequence, which therefore converges in R. Since we 
have c(iC„_i) C c(K n ) for each n, the limit of the sequence is p(xo,K). Now, for 
n ^ 1, 

fc = 1 V fc = 2 / 

so, by (a), 

p(x 0 ,K) = lim p(x 0 ,c(K n )) ^ -p(x 0 ,c(K 0 )) ■ 

n— >■ oo 4 

But by the convexity of Kq, 


P (x 0 , c(Kq)) = inf {||ta - x 0 || : t > 0, x G K 0 } 


= inf | (t + 1) 
^ p(0,K o ). 


t 


1 


t + 1 


t + 1 


(-z 0 ) 


: t > 0, x £ K( 


Hence p(x 0 ,K) > 0. 

Let 

L = {x £ X : —x £ K} . 

Then L is an open convex cone containing Xq. By (c), for each n, either p ( x n , K) < 
1/n or p ( x n , L) < 1/n. So K U L is dense in X. On the other hand, I\ C for if 
x £ K and y £ L, then there exists n such that x £ c (K n ) and —y £ c ( K n )\ so there 
exist t > 0 and z £ K n such that x — y = tz and therefore \\x — y\\ ^ tp( 0, K n ) > 0. 
It now follows from Lemma 5.2.4 that L U —L is dense in X, and the common 
boundary of K and L is the set 
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TV = K n L, 


which is a linear subspace of X. 

To prove that TV is a hyperplane, let 7 denote the boundary crossing map of 
L relative to its interior point Xq. Note that, by Lemma 5.2.4, K C — L\ so, by 
Proposition 5.1.5, for each x € K there exists a unique t € (0, 1) such that 

7 (a;) = tx 0 + (1 — t) x 

and therefore 

x = ih + 

Similar considerations using the boundary crossing map of K relative to —Xo com- 
plete a proof that each x € K U L can be expressed in the form axo + /3z with a € I 
and 2 G TV. In fact, such an expression obtains for all x € X. For since K U L is 
dense, there exist a sequence (ct n ) n ^ 1 hr R and a sequence (z n ) n ^ 1 in TV such that 


a n XQ + z n — > x as n — > 00 . Given e > 0 and m > n, we have either |a m — a. 

or \a m — a n \ >0; in the latter case, since z m — z 

n e TV c K, 

\\(a m x 0 + z m ) - (a n xo + z n )\\ = \a m - a n \ 

Xo . , (z m Zn) 

| ®-n | 

^ |^m | p(%0 5 -^0 ? 


so 




1 


p(xo,K) 


|| (&m X 0 + Zm ) - (a n x 0 + z n ) || . 


It follows that (oL n ) n> 1 is a Cauchy sequence and therefore converges to a limit a 
in R. Writing z = x — axo, we have 


\z - z n || < II® - (a n x 0 + z n ) || + \a- a n \ ||x 0 | 
— > 0 as n — > 00, 


so x € TV = TV. Thus TV is a hyperplane with associated vector xq, and therefore 
(since also K U L is dense in X) K is a half-space. 

Now, for each x € K we have j(x) € TV C K and ||®o — 7 (®)|| < ||®o — x\\ • 
It follows that p(xo,N) exists, equals p(xo,K), and is positive. For each x € X, 
choosing a G R and z G TV such that x = ax 0 + z, we easily see that p (x, TV) exists 
and equals |a| p(x 0, K). Hence TV is located. □ 


Corollary 5.2.7. Under the hypotheses of Theorem 5.2.6, there exists a normed 
linear functional u on X such that ||u|| = 1 and Ren is positive on Kq. 


Proof. First take the case IK = R. Let K be as in the conclusion of Theorem 5.2.6 
let 
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L = {— x : x G K} , 

recall that K C —L, and let N be the common boundary of I\ and L. Since N is 
a located hyperplane, we see from Propositions 2.3.4 and 2.3.6 that there exists a 
normed linear functional v on X such that ker(u) = N and v(xq) = 1. Defining 
u = — |M| v, we obtain a normed linear functional on X with norm 1 such that 
ker(u) = N and u{xq) < 0. Let 7 be the boundary crossing map of L relative to xq. 
Given x G K, choose A G (0, 1) such that 7(2;) = \x + (1 — A) xq. Then ^(7(2;)) = 0, 

so 1 A 

u( x) = — u(x 0) > 0. 

A 

Hence u is positive on K and therefore on Kq. 

Now consider the case K = C. By the case just considered, there exists a real 
linear functional v : X — > R. on the real linear space X such that v has norm 1 
and is positive on Kq ■ The following lemma then shows that 

u(x) = v(x) — iu(i2;) 

defines a linear functional u : X — > C on the complex linear space X such that 
||u|| = 1 and Reu is positive on K 0 . □ 


Lemma 5.2.8. If X is a normed linear space over C, then there is a one-one cor- 
respondence between real linear functionals v on X and complex linear functionals 
u on X, given by 

u{ x) = v(x) — iv(ix). (5-5) 

Moreover, if either u or v is normed, then both are and their norms are equal. 

Proof. If u is a complex linear functional, then u = v + iw for unique real linear 
functionals v and w. But 

iu(2;) — w(x) = iu(x ) = u(ix) = v(ix) + iw(ix), 


so w(x) = — v(ix). 

Conversely, if v is a real linear functional, then (5.5) defines a linear functional u 
that respects multiplication by real numbers; it is easily checked that u(i2;) = iu(x). 
The final conclusion, about the norms of u and v, is a straightforward consequence 
of the identity 

M*)| = sup{|u(A2;)| : |A| = 1}, (5.6) 

which we now establish. If |A| = 1, then 

|r*(cc) | = | Au(2r) I = |u(Ax)| ^ |Reu(Ax)| = |u(Ax)| . 

On the other hand, if £ > 0, then either |u(x)| — e < |u(x)|, or else u(x) ^ 0. In the 
latter case, if we let A = \u{x)\/u{x), then 
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u( Xx) = A u(x) = |w(a;)| £ R, 

so |w(;r)| = Re'u(Ax) = v(Ax). Notice that in this case the supremum in (5.6) is 
achieved. □ 


The following result is the full form of the separation theorem. 

Theorem 5.2.9. Let A and B be bounded convex subsets of a separable normed 
space X such that the algebraic difference 

{y — x : x £ A, y £ B} 

is located and the mutual distance 

d = inf {||y — x|| : x £ A, y £ B} 

is positive. Then for each e > 0 there exists a normed linear functional u on X , 
with norm 1, such that 

Re u(y) > Re u(x) + d — e (x £ A, y £ B) . 

Proof. We may assume that e < d. Write 

K 0 = {y - x - z : x £ A, y £ B, ||z|| < d- , 

which is bounded, open, and convex. It follows from Lemma 5.2.5 that Kq is lo- 
cated and p(0,I\ 0 ) = e/2. Hence, by Corollary 5.2.7, there exists a normed linear 
functional u on X such that ||u|| = 1 and Rett is positive on Kq. Choose z £ X 
such that ||,z|| < d — e/2 and u(z) > d — e; this is possible since ||it|| = 1. For all 
x £ A and y £ B we have y — x — z £ Kq and so Re u(y — x — z) >0; whence 

Re u(y) > Re (u(x) + u(z)) > Re u(x) + d — e, 


as required. □ 


An inhabited subset K of a normed space X is said to be balanced if ax £ K 
whenever x £ K and |a| ^ 1; in that case, 0 £ K. 

Corollary 5.2.10. Let Xq be a vector in a separable normed space X, and K a 
located subset of X that is bounded, convex, and balanced, such that p(xo,K) > 0. 
Then for each e > 0 there exists a normed linear functional u on X with norm 1 
such that 

u(x 0 ) > |zt(a;)| + p(x 0 , K) - e 

for all x £ K . 
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Proof. We may assume that 0 < £ < p(x o, K). Applying Theorem 5.2.9 with A = I\ 
and B = {xo}, construct a normed linear functional v on X with norm 1 such that 

Re u(xo) > Re v(x) + p(x o, K) — e (x £ K) . 

Since 0 € K, we have v{xq) yf 0. Thus 

u(x 0 ) 

is a normed linear functional on X with norm 1 such that 

u(x o) = |u(xo)| ^ Reu(xo) > Reu(x) + p(x o, K) — e (x £ K) . 

For each x £ K, either u(x o) > |«(x)| + p(x o, K) — e, or else u(x ) yf 0 and therefore 
v(x ) yf 0. In the latter case, since K is balanced, 


;(a 


and therefore 


u{x o) > Re v 


v(x) 
v{x)\ 


X £ K , 


v(x) 

= |v(a:)| + p(x 0 ,K ) - s 
= \u(x)\ + p(xo, K) - £, 


x + p(x o, K) — e 


as required. 


□ 


5.3 The Hahn— Banach Theorem 

The Hahn-Banach theorem enables us to extend a normed linear functional, with an 
arbitrarily small increase in norm, from a subspace of a normed space to the entire 
space. This fundamental result has numerous applications throughout functional 
analysis. 

In the constructive context we deal only with the extension of linear functionals 
on subspaces of a separable normed space. The standard classical proofs extend- 
ing the theorem to nonseparable normed spaces depend on Zorn’s lemma and are 
therefore nonconstructive. 

We begin with two more consequences of the separation theorem. 

Proposition 5.3.1. Let x be an element of a nontrivial separable normed space 
X, and let £ > 0. Then there exists a normed linear functional u on X such that 
||u|| = 1 and u(x) > ||x|| — e. 
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Proof. If x ^ 0, then we may apply the separation theorem (Theorem 5.2.9) with 
A = {0} and B = {cc} . In the general case, choose a nonzero vector y such that 
1 1 x — y 1 1 < e/2, and then construct a normed linear functional u on X such that 
\\u\\ = 1 and u(y) > ||y|| — e/2. We have 

u{x) ^ u(y) - | u(x) - u(y)\ > \\y\\ - | - ||x - y\\ > ||a;|| - e, 
as required. □ 


Corollary 5.3.2. Let V be a located subspace of a separable normed space X such 
that the metric complement —V is inhabited. Then there exists a normed linear 
functional u on X such that ||u|| = 1 and u(V) = {0} . 

Proof. Let e be a unit vector in —V. Applying Proposition 5.3.1 in the quotient 
space X/V. construct a normed linear functional u on X/V such that ||tt|| = 1 and 
u(v) ^ 0. Clearly, V C ker(u). Since || || X j V < || || , we see that u, regarded as a 
linear functional on the original normed space X 1 has bound 1. On the other hand, 
given s > 0, we can find x € X with ||a;|| A -/y < 1 and u(x) > 1 — e; choosing 
y € V with ||x — y\\ < 1, we then have u{x — y) = u(x) > 1 — e. Since £ > 0 is 
arbitrary, it follows that, as a linear functional on X, u is normed and has norm 1. □ 


Proposition 5.3.1 is crucial for our proof of the Hahn-Banach theorem. 

Theorem 5.3.3. Let v be a nonzero bounded linear functional on a linear subset 
Y of a separable normed linear space X such that ker(u) is located in X. Then for 
each £ > 0 there exists a normed linear functional u on X such that ||tt|| ^ ||i>|| +£ 
and u(y) = v(y) for each y £ Y. 


Proof. First note that, by Proposition 2.3.6, v is normed as a linear functional on 
Y. Fix yo in Y with v(yo) = 1. Then for each x £ ker(u), 

1 1 2/o - *|| > JiKv{yo -x)= 1 

M IMI 

from which it follows that 

ll2/o||jr/ker(v) = p(y 0 ,ker(u)) ^ pp 

For each normed linear functional u on X/ker(u) denote the norm by |it|| x / ker („) . 
Using Proposition 5.3.1, construct a normed linear functional uq on X/ker (v) such 
that ||uo||x/ker(„) = 1 and Mvo) > l/GMI +£)■ Then 


uo (yo) 


u 0 


u = 
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is a normed linear functional on X/ker(u) such that ||u|| Y/ker(«) < Mil + £ and 
u (yo) = 1- Since || ||x/ker(jU ^ II II on M we see that regarded as a linear 
functional on the original normed space X, has bound Mix/ ker (u) • To see that 
this is actually the norm of u on X, consider any S > 0 and choose x £ X such 
that IMIx/ker/u) < 1 and u ( x ) > ll u IM/ker(v) “ <*• Finding y G ker(u) such that 
||x — y|| < 1, we have 

u(x -y) = u{x) > |M| x/ ker(„) ~ S - 
Since S > 0 is arbitrary, we conclude that u is normed on X 1 with 

IMI = MIM/ker(v) < Mil +«• 

Finally, for each y G Y we have y — v(y)yo G ker(u), so 

0 = u(y - v(y)y 0 ) = u(y ) - v(y), 

and therefore u{y) = v(y). □ 


The Hahn-Banach theorem and some of its associates have surprising applica- 
tions. As surprising as any is the following, whose classical proof is almost trivial. 

Proposition 5.3.4. Let Xi , . . . , x n be elements of an infinite- dimensional normed 
space X, and let e > 0. Then there exist linearly independent elements e\, . . . , e n of 
X such that \\xi — e»|| < e for each i. 

Proof. Construct a finite-dimensional subspace V of span {24 , . . . , x n } such that 
for each i there exists y t G V with ||xj — Ml < e /2 ( see Lemma 4.1.11). Since 
X is infinite-dimensional and V is finite-dimensional, we can embed V in an n- 
dimensional subspace W of X. We may assume that y\ yf 0. Setting e\ = y \ , 
suppose that for some k < n, we have constructed linearly independent elements 
ei, . . . , efc of W such that |M — e*|| < e/2 for 1 < i ^ k. Let V/ be the fc-dimensional 
subspace of W with basis {ei, . . . ,efc} . By Corollary 5.3.2, there exists a normed 
linear functional u on W such that u(Vk) = {0} and ||tt|| = 1. Construct a vec- 
tor z G W such that ||z|| = e/2 and u(z) > e/3. Either u(yk+ 1) yf 0 or else 
\u(yk+i)\ < e/3. In the first case, p(yk+ i,Vfc) > 0 and we set ek+i = 2/fc+i- I n the 
second case, u(y k + 1 - z) yf 0, p (y k + 1 - z, V k ) > 0, and we set e fc +i = y k + 1 - z. In 
each case, the vectors e ±, . . . ,e k +i are linearly independent, by Lemma 4.1.10, and 
Mfc+i — efc+i|| < e. This completes the inductive construction. □ 


The classical Hahn-Banach theorem says that we can extend a bounded linear 
functional v from Y to a functional u on the whole space X with exact preservation 
of norm: that is, ||it|| = Mil • I n general, as Exercise 5 shows, we cannot do this 
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constructively. But, as we shall see, if we impose extra conditions on the norm of 
X, then we can make the extension norm-preserving. 

A mapping u of a linear space X into R is said to be convex if 
u (Ax + (1 — A) y) ^ Au(x) + (1 — A) u(y) 
whenever x, y £ X and 0 ^ A ^ 1. 

Lemma 5.3.5. Let u be a convex mapping of a linear space X into R such that 
u(x ) = —u(—x) for each x £ X. Then u is linear. 

Proof. First note that u( 0) = — u(— 0) = — u(0), so u( 0) = 0. For each x € X and 
for 0 ^ A ^ 1 we have 

u( Ax) = u( Ax + (1 — A) 0) ^ Au(x) + (1 — A) u(0) = A u(x). 

Replacing x by —x, we obtain u(— Ax) ^ A u(—x)] whence, by our hypotheses on u, 
— tt(Ax) ^ —Au(x) and therefore A u(x) ^ u(Ax). Hence u( Ax) = A u(x) whenever 
0 < A < 1. 

Now consider any real A yf 0. If A > 0, then by the foregoing, 

and therefore A u(x) = u( Ax). If A < 0, then 

u( Ax) = u((—A)(—x)) = —Au(—x) = A(—u(—x)) = A u(x). 

Now consider any A £ R, and assume that u( Ax) yf Au(x). The foregoing shows 
that 

-> (A > 0 V A < 0) 

and therefore A = 0. But this is absurd, since u(0x) = 0 = 0u(x). We conclude that 
u(Ax) = Au(x) for all A £ R. 

It remains to prove the additivity of u. By the convexity of u, 
u QO + V)J < \ u ( x ) + \u(y). 

Applying the first part of the proof yields u(x + y) ^ u(x) + u(y). We now replace 
x, y by —x, —y respectively, to obtain 

u(—(x + y)) = u(—x — y) ^ u(—x) + u(—y) 

and therefore, by the hypotheses on u, u(x + y) ^ u(x) + u(y). Hence u(x + y) = 
u(x) + u(y). □ 
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The norm of a normed space X is said to be Gateaux differentiable at x £ X 
if for each y £ X the limit 


u x (y) = lim 


\\x + ty\\ - ||x|| 
t 


exists. For each c > 0 we then have 

t \ \\x + c-Hy\\ - ||®|| 

u C x{y) = lim = u x {y). 

If the norm of X is Gateaux differentiable at each nonzero point of X , we say that 
X has Gateaux differentiable norm. In such a case we obtain a strengthening of 
Proposition 5.3.1. 


Proposition 5.3.6. Let x be a nonzero vector in a normed space X , and suppose 
that the norm of X is Gateaux differentiable at x. Then the mapping u x is the 
unique normed linear functional u on X with ||u|| = 1 and u(x) = ||x|| . 


Proof. It is enough to take the case K = K. It is straightforward to show that u x is 
convex, and that u x (y) = — u x {—y ) for each y £ X; whence, by Lemma 5.3.5, u x is 
linear. Moreover, \u x (y)\ ^ ||r/|| and u x (x) = ||x||. So ||it x || = 1, and it only remains 
to prove the uniqueness. To this end, let u be any normed linear functional on X 
such that ||u|| = 1 and u( x) = ||x|| . For each y £ X and each t > 0 we have 

II® - ty || - ||x|| u(x - ty) - u(x) u(-ty) 

: % : = : — = u(y) 


and 


u(y) 


u(x + ty) - u(x) II® + ty || - ||®|| 
t ^ t 


Keeping y fixed and letting t — > 0 in these two sets of displayed inequalities, we 
obtain u x (y) < u(y) < u x (y); whence u(y) = u x (y). □ 


Proposition 5.3.7. Let Y be a subspace of a uniformly convex Banach space X 
with Gateaux differentiable norm, and let v be a nonzero normed linear functional on 
Y. Then there exists a unique normed linear functional u on X such that ||tt|| = ||i>|| 
and u(y) = v{y) for each y £ Y. 

Proof. We may assume that ||t>|| = 1. Using Exercise 19 of Chapter 2 if necessary, 
we may further assume that Y is closed in X and is therefore a Banach space. 
Since Y is uniformly convex, it follows from Proposition 2.3.7 that there exists a 
unique a £ Y such that ||a|| = 1 = v(a). By Proposition 5.3.6, u a is a normed 
linear functional on X such that ||u a || = 1 = u a (a). Applying the uniqueness part 
of Proposition 5.3.6 in the space Y. we conclude that u a (y) = v(y) for all y £ Y. 
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Finally, if u is a normed linear functional on X such that ||u|| = ||u|| and u(y) = v(y) 
for all y G Y, then 

u(a) = v(a) = 1 = |M| = H| , 

so, again by the uniqueness part of Proposition 5.3.6, u = u a ■ □ 

Not surprisingly, there is also a strong version of the separation theorem that 
applies in a uniformly convex separable Banach space; see Exercises 9 and 10. 


5.4 Locally Convex Spaces 

Although Errett Bishop considered that 

in most cases of interest it seems to be unnecessary to make use of any deep 
facts from the general theory of locally convex spaces ([9], page 350), 

the development of constructive analysis (in particular, aspects of the theory of 
operators) in recent years has greatly benefited from such a general theory, which 
we now present. 

A locally convex space consists of a linear space X over K, a family (pj / 
of seminorms on X , and the equality and (compatible, tight) inequality relations 
defined by 

x = y <=> ViGl {pi (x-y) = 0) , 
x^y 4=4- 3iG I ( pi (x-y) > 0) . 

Following normal practice, we call X itself a locally convex space when it is clear 
which family of seminorms is under consideration. The family ( Pi) ie j and the as- 
sociated equality and inequality together form the locally convex structure on X. 
The corresponding locally convex topology on X is the family tx of all subsets of 
X that are unions of sets of the form 

U(a, F, e) = < x € X : ^ pi (x — a) < e 

l i<EF 

where a G X, F is an inhabited finitely enumerable subset of /, and e > 0. The 
seminorms Pi ( i G I) are called the defining seminorms of t x ■ The members of tx 
are called open subsets of X, and the sets U (a, F, e) basic neighbourhoods of a. On 
the other hand, if S' is a subset of X , then its closure S is the set of all elements x 
of X such that S nU(x, F,e) is inhabited for all finitely enumerable F C I and all 
e > 0. We say that S is closed (in the locally convex topology Tx) if S = S, and 
that S is dense in X if S = X. The unit ball of the locally convex space X is 


{x G X : Vi G I ( Pi (x) < 1)} , 
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which is a closed subset of X. 

We regard a normed space ( X , || ||) as a locally convex space in which the family 
of seminorms consists of the single norm || ||. In that case, the locally convex topol- 
ogy is just the metric topology associated with the norm on X, and the various 
notions of convergence, total boundedness, locatedness, and so on arising from the 
locally convex structure on X are precisely those associated with the norm on X. 

In the following, unless we state otherwise, (X, (pi) i€l ) and (y, ( Qj)j e j) are 
locally convex spaces over K. A mapping / of a subset S of X into Y is 

> continuous at a € S if for each e > 0 and each finitely enumerable G C J, 
there exist S > 0 and a finitely enumerable F C I such that if x £ S and 
E Pi(x~a) < S, then E <lj(f( x ) - f ( a )) < £ ; 

ieF jeG 

> continuous on S if it is continuous at each point of X; 

> uniformly continuous on S if for each e > 0 and each finitely enumerable subset 
G of J, there exist 6 > 0 and a finitely enumerable subset F of / such that if 

x,y G S and E Pi( x ~ u) < ^ then E ~ f(y)) < £ - 

ieF jeG 

Notice that each of the defining seminorms pi on X is uniformly continuous on X. 


Proposition 5.4.1. The following are equivalent conditions on a linear mapping u 
between the locally convex spaces X and Y: 


(a) u is continuous at 0. 

(b) u is continuous on X . 

(c) u is uniformly continuous on X. 

(d) There exist a positive real number C and a finitely enumerable subset F of I 
such that 

|«( x )l ^ C sup ^ (a;) 

ieF 


for each x € X. 


Proof. It is routine to show that (d) ==> (c) ==> (b) =£> (a). Suppose that u is 
continuous at 0. There exist 5 > 0 and a finitely enumerable subset F of I such 

that if E Pi ( x ) < then |u(x)| < 1. It follows that for each x € X and each £ > 0, 
ieF 


u 


Sx 


E Pi( x ) + e 

^ ieF 


< 1 
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and therefore 


M®) I < 


1 

6 



Since e > 0 is arbitrary, it follows that 


Ms) | < 


i&F 


Thus (a) 


(d). 


□ 


Naturally, in view of the analogous situation in a normed space, we say that a 
mapping u satisfying one, and hence all, of conditions (a)-(d) of Proposition 5.4.1 
is a bounded linear mapping. 

We say that a sequence (x n ) n>1 in a locally convex space (X, (Pi) ieI ) 

► converges to a limit x in X if for each basic neighbourhood U = U(x, F, e) of x 
in X there exists n o such that x n £ U whenever n ^ hq\ 

► is a Cauchy sequence if for each e > 0 and each finitely enumerable subset F of 
/, there exists no such that Pi{x m — x n ) < e whenever m, n ^ n q. 

idF 

We say that X is complete if every Cauchy sequence in X converges to a limit in 
X. (Strictly speaking, this definition applies only when X is separable — that is, has 
a countable dense subset. However, it will suffice for our purposes.) 

Let S' be a subset of X, F a finitely enumerable subset of I, and £ > 0. By an 
e-approximation to S relative to F we mean a subset T of S such that for each 
x € X there exists y £ T with Pi{x — y) < e. We say that S is 

i£F 

> totally bounded relative to F if for each £ > 0 there exists a finitely enumerable 
£-approximation to S relative to F; 

> totally bounded if it is totally bounded relative to each finitely enumerable subset 
of /. 

The proofs of the next five results are similar to those of their counterparts in 
metric space theory (see Chapter 2, Section 2) and are left as exercises. 

Proposition 5.4.2. If S is a totally bounded subset of a locally convex space, and 
f is a uniformly continuous mapping of S into a locally convex space Y, then f(S) 
is totally bounded. 
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Proposition 5.4.3. If S is a totally bounded subset of a locally convex space, and 

f is a uniformly continuous mapping of S into R, then sup /(a;) and inf f{x) exist. 

xes x ^ s 


A subset S of the locally convex space (X, (pi) ieI ) is said to be located (in X) 


if 


inf < Pi (x - y) ■ y £ S 

UeF 

exists for each x £ X and each finitely enumerable subset F of I. As it does in 
metric spaces, locatedness plays an important role in locally convex spaces. 

Proposition 5.4.4. A totally bounded subset of a locally convex space is located. 


Proposition 5.4.5. A located subset of a totally bounded set in a locally convex 
space is totally bounded. 

Theorem 5.4.6. Let S be a totally bounded subset of a locally convex space X , and 
let f be a uniformly continuous mapping of S into R. Then for all but countably 
many t > inf f(x), the set 

{x £ S : f(x) ^ t} 

is totally bounded. 

Now consider the set B(X, Y) of all bounded linear mappings between the locally 
convex spaces X and Y. This set, which we often denote by B when it is clear which 
spaces X and Y are under consideration, becomes a locally convex space when 
taken with pointwise addition and multiplication-by-scalars and endowed with the 
seminorms p x defined by 

p x (T) = \\Tx\\ (x £ X, T £ B(X,Y)) . 

We denote the unit ball of B(X,Y) by B\ (X,Y) or just B\. When X = Y, we 
usually write B(X) and B\{X) rather than B(X,Y) and B 1 (X,Y). 

In the special case where Y is the ground field K, we obtain the space of all 
bounded linear functionals on X; this space is called the dual space of X, and is 
denoted by X* or sometimes, for clarity, by (X, ( Pi) ieI ) ; its unit ball is denoted 
by X*. The topology associated with the family of seminorms (p x ) xeX on X* is 
called the weak* topology on X*. When we are dealing with, for example, total 
boundedness relative to the locally convex structure on X*, we speak of weak* - 
total boundedness. 

In classical analysis, when X and Y are normed spaces, a big role is played by 
the operator norm 

ll T ll = sup{||Ta;|| : x £ X, ||z|| < 1} 

of an element T of B(X, Y). Constructively the operator norm exists only for those 
elements T that are normed; but Corollary 4.1.8 shows that it exists for all T £ 
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B(X,Y) when X is finite-dimensional. In that case, X* is algebraically isomorphic 
to X, every element of X* is normed, the weak* topology on X* coincides with 
the topology associated with the operator norm, and X* is a Banach space relative 
to the operator norm; these facts are easily verified using results from Section 1 of 
Chapter 4. 

We now examine the unit ball of the dual space. 

Theorem 5.4.7. ( Banach Alaoglu theorem) If X is a separable normed space, 
then Xi is complete and totally bounded relative to the locally convex structure on 
X*. 

Proof. To prove the weak*-completeness, let (v„) n>1 be a Cauchy sequence in X*. 
Given £ > 0 and a finitely enumerable subset F of X , we can find N such that 

| (vm ~ ^n)(aQ| < g (m,n^N). (5.7) 

xeF 

It readily follows that for each x £ X, (v n (x)) n>1 is a Cauchy sequence in IK. Since 
IK is complete, the limit v(x) of the sequence (v n (x)) n>1 exists in IK for each x € X. 
It is straightforward to show that the mapping v : X — > IK so defined is linear and 
belongs to X). Finally, keeping n ^ N fixed and taking the limit with respect to 
m in (5.7), we obtain 

|(p - v„) (x) | < £ (n > N) . 

xeF 

Since £, F are arbitrary, the sequence {v n ) n>1 converges to v in the weak* topology. 

To prove the weak*-total boundedness of Xf, let F = {aq, . . . , x m } be a finitely 
enumerable subset of X, let 

M > 4 + max{||xi|| : 1 < i < to} , 

and let 0 < £ < 1. By Lemma 4.1.11, there exists a finite-dimensional subspace Xq 
of X such that for 1 ^ i ^ to, p(xi,X o) < e/m and therefore there exists y, £ Xq 
with || xt — yi || < e/m. If X 0 = {0}, then for each v £ Xf, 

m m 

k^) - u (°)i ^ ii^ii < £ > 

i= 1 i= 1 

so {0} is an ^approximation to Xf relative to F. We may therefore assume that Xq 
has positive dimension. Then every element of Xq (the dual of Xo) is normed, and 
Xq, taken with the operator norm, is a finite-dimensional Banach space. Hence its 
unit ball is compact relative to the operator norm (Corollary 4.1.7). Moreover, by 
Proposition 2.3.6, each nonzero element of Xq has its kernel located in X 0 ; since X 0 
is locally compact, this kernel is locally compact and hence is located in the space 
X (Proposition 2.2.18). It follows that the Hahn-Banach theorem can be applied 
to extend nonzero bounded linear functionals from Xq to X. 
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Let { Ui, ■ ■ ■ > u n } be an e/m-approximation to the unit ball of Xq in the operator 
norm such that 0 < \\u k \\ <1 for each k. By the Hahn-Banach theorem, there exist 
normed linear functionals «i, . . . , u n in X* such that Uf~{x) = u k (x ) for each x € X 0 
and each k. Given u £ X since the restriction of u to X 0 is in the unit ball of 
Xq, we can find k such that |u(;r) — ■u°.(:r)| < e/m for all x £ X 0 with ||a;|| ^ 1. For 
each i (1 ^ i ^ m) we have 


U k {Xi) 1 < 

10- 

< 

2 || Xi 


2e 

< 

b 


m 


Ms 

< 

. 


m 


(u - u° k ) 


1 + II Vi 


\ £ 
m) m 


Hence 

m 

Y. I u(xi) - u k {xi) I < Me. (5.8) 

i—\ 

We now see that {u ±, . . . , u n } is an Me- approximation to XJ relative to F. □ 


Corollary 5.4.8. If X is a separable normed space, then the normed linear func- 
tionals on X are weak* -dense in X*. 

Proof. In the notation of the last paragraph of the foregoing proof, the function Uk 
satisfying (5.8) is normed. Since e > 0 and the finitely enumerable subset F of X 
are arbitrary, the result follows. □ 


Let X be a normed space. It is easy to see that, for a fixed vector x £ X, the 
linear functional u u{x) on X* is weak* -uniformly continuous on X/. We shall 
show that any element of X** (the dual of X*) that is uniformly continuous on 
XJ can be approximated arbitrarily closely by functionals of this special form, and 
that if X is complete, then this approximation can be made exact. 

Lemma 5.4.9. Let V be a locally convex space whose unit ball V\ is totally bounded, 
and let f> be a nonzero linear functional on V that is uniformly continuous on V\. 
Then the unit kernel, V\ flker(0), of f> is totally bounded. 

Proof. Since (j> is nonzero and uniformly continuous on the totally bounded set Vi, 

C = sup{|^(y)| : y £ Vi} 

exists and is positive. Using the linearity of <\>, choose y £ V\ with (p(y) > C/2. 
Then 
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V ° 2 J>(y) y 

belongs to V\, and <j>(yo) = C/2. Let e be a positive number, and 


F = {pi,...,p m } 

a finitely enumerable set of defining seminorms on V. Using Theorem 5.4.6, compute 
a positive number 

Ce 

t < 75 — ; — 

C + 4m 

such that 

S t = {y&V i : \<j>{y)\ < t} 

is totally bounded. Pick a t-approximation {si, . . . , s n } of S t relative to F. Set 

Vk = c +2t ( Sfc ~ (!<*<«)■ 

Then y k £ ker(^). If pi is any defining seminorm on V, then we have 


PiiUk) < 


C + 2t V v y 
C / 2i 
C + 2t l i+ C 


Pi(s k ) + — \<j>{sk)\Pi{yo ) 


So yk belongs to Vi ft ker ((/>). 

Now consider any element y of Vi D ker((/>). Since y € St, there exists k such 

m 

that Pi(y - s k ) < t and therefore 
2=1 

m m m 

^2,Pi{y- y k) < ^2pi(y - s fc ) + ^Piisk - s/fc) 


C + 2t ^ 


^PiitSk + (/>(sk)yo) 


«‘+c£ (Pi(sfc) +P*(j/o)) 


/ 4m. 

1 + -c 


Thus {j/i, . . . , y n } is a finitely enumerable e-approximation to Vi D ker(</>). □ 
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Lemma 5.4.10. Let X be a normed linear space, and <j> a linear functional on X* 
that is uniformly continuous on X /. Then for each £ > 0 there exist 6 > 0 and a 
finite- dimensional subspace X 0 of X such that if u £ X/ and |u(x)| < S for all x 
in the unit ball of X o, then |</>( w )l < £. 

Proof. There exist £i, •••,£„ in X with ||4|| < 1/2 for each k, and a positive 
number 5, such that if u £ X/ and |u(4)| < 2 6 for each k, then |</(u)| < e. 
By Lemma 4.1.11, there exist a finite-dimensional subspace X 0 of X , and vectors 
xi, ... ,x n of Xo, such that 

114 -ttkll < min (l<fc<n). 

Suppose that u G and \u(x)\ < S for all x in the unit ball of Xq. Then for each 

k 

K4)l < K4 - x k )\ + \u(x k )\ < ||4 - X k \\ + 6 < 26 . 

Hence |0(u)| < e. □ 


Lemma 5.4.11. Let X be a separable normed space, Xq a finite- dimensional sub- 
space of X, and <f> « nonzero linear functional on X that is weak* -uniformly con- 
tinuous on X/. Denote the unit kernel of <f> by A f\. Then the following hold: 

(a) ||<^|| = sup {|0(it)| : u £ X/} exists. 

(b) ||x|| 0 = sup{|u(x)| : u £ A/i} defines a seminorm on Xo- 

(c) /3 = inf {||x|| 0 : x £ Xo, ||x|| = 1} is well defined. 

Suppose that j3 > 0. Then 

(d) (Xo, || ||o) is a finite- dimensional Banach space, and 

IMI° = sup{|(u(x)| : x £ X 0 , ||x|| 0 < 1} 
defines the corresponding operator norm on the dual space Xg. 

Finally, for each u £ Afi let F(u) denote the restriction of u to Xg. Then 

(e) F is weak* -uniformly continuous, and ran(F) is || ||°-dense in the unit ball Sq 

of (xs, II 11°) . 

Proof. Since <f> is weak*-uniformly continuous on X* and, by Theorem 5.4.7, the 
latter is weak*-totally bounded, we see that ||<^|| exists. Also, Af\ is weak*-totally 
bounded, by Lemma 5.4.9, and the mapping u u(x) is weak*-uniformly continu- 
ous on X*, so ||x|| 0 exists. Clearly, || || 0 is a seminorm and || || 0 < || ||; whence || || 0 
is uniformly continuous on (Xg, || ||) . Since that space is finite-dimensional, 
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{x GX 0 : ||ai|| = 1} 

is compact, and so p exists. 

Now suppose that (3 > 0. Then || || < /3 _1 || || 0 , so || || 0 is a norm on Xo 
equivalent to || || , and (Xg, || || 0 ) is a finite-dimensional Banach space. The mapping 
F : M — > (XjJ, || ||°) is weak*-uniformly continuous on the weak*-totally bounded 
set Afi, so its range is totally bounded, and hence located, in (Xq, || || ). To show 
that ran(F) is dense in the unit ball Sq of (Xg 5 || ||°), fix u 0 in Sq and suppose that 

0 < c = inf j||w 0 — F(u)||° : u € 7Vi| . 

By Corollary 5.2.10, there exists a normed linear functional on (Xq, || ||°) such 
that 

<P(u 0 ) > |<2>(F(u))| + | (u e M) • 

Now Xq, being finite-dimensional, is equivalent to a Hilbert space. (Recall from 
Corollary 4.1.9 that all norms on a finite-dimensional space are equivalent to each 
other and hence to the Euclidean norm.) It follows from this and the Riesz repre- 
sentation theorem that there exists ( € X 0 such that F(v) = v(() for each v € Xq. 
Therefore 

u 0 (C) > sup{|F(u)(C)| : u G A/i} = sup {KC)| : u G M} = HCIIo - 
This is absurd, since u 0 € Sq. Hence, in fact, c = 0 and ran(F) is || ||°-dense in Sq. □ 

Lemma 5.4.12. With the hypotheses and notation of Lemma 5.4.11 and its proof, 
suppose that ||0| > 1 and let a, 6 be positive numbers. Suppose also that |</>(w)| < 
a/2 whenever (a) u € X { and (b) |u(a;)| < S for all x in the unit ball of Xq. Then 
(3 < a. 

Proof. Choose uq € X { such that <j>(uo) = 1. Since 

\u 0 {(3x)\ = P |u 0 {x) | < P ||x|| < ||;r|| 0 , 

the linear functional 

T : x Uq(Px) 

belongs to Sq. Now, either P < a or P > 0. Assuming the latter inequality, we see 
from Lemma 5.4.11 that there exists u £ Afi such that 

\\F -F(u)||° < 25. 

So for all x in the unit ball of (X 0 , || || 0 ) , 


\uq(Px) — u(x) | < 26 



136 5 Linearity and Convexity 


and therefore 


1 

2 


(f3u 0 - u) ( x ) 


< (5. 


But /? < 1, and both w 0 and —u belong to the convex set X/, so \ (0uo — u) £ X*; 
whence 

<t> Q (Ko - 

Therefore 

P = KOo) - <t>( u ) = 2<t> Q (fiuo - u)^j < a, 
as we required. □ 



Proposition 5.4.13. Let X be a separable normed space, and <fi a linear functional 
on X* that is weak* -uniformly continuous on X/. Then for each e > 0 there exists 
x € X such that ||a;|| < 3 ||</>|| and 

| (j>{u) — m ( x )| < £ (u £ XI) . 

Proof. If ||</>|| < £, then we can take x = 0; so we may assume that ||</|| > 0. Scaling 
if necessary, we may also assume that there exists uq £ X* with 4>(uq) = 1. Pick 
a positive number a , which we shall specify further as the proof develops. Using 
Lemma 5.4.10, construct a finite-dimensional subspace Xo of X, and a positive 
number 5, such that if u £ X/ and |u(x)| < 5 for all x in the unit ball of Xo, then 
|0(r*) | < a/2. Applying Lemma 5.4.12, we see that there exists z £ X 0 such that 
||z|| = 1 and \u(z)\ < a for all u £ Af\. For each u £ XJ“, since 

j-dpi («-*(«)«) eM, 


we therefore have 

1+ Ii^n l(« - <t>{u)u 0 ) (z) I < a. 

By Proposition 5.3.1, there exists U\ £ X/ such that Ui(z) > 1/2. Thus 

\ < IK - <t>{ui)uo) {z)\ + |</>(«i)| |«oK| < (1 + H\\)a+ ||(/>|| MK . 
and therefore 

l ” o(z)l> HiG- (I + w) “) > 3H 

provided we choose a small enough. In that case, writing 

1 

Mz) z ' 


X = 
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we see that ||x|| < 3 ||<^||. Also, for each u G X*, 

I u(x) - <j>(u) | = |^|yy| I (U - <j){u)u 0 ) (z) \ < 3 ||0|| (1 + ||</>||) a, 
which can be made less than £ by a suitably small choice of a. □ 


Theorem 5.4.14. Let X be a separable Banach space, and <fi a linear functional 
on X* that is weak* -uniformly continuous on X£. Then there exists x € X such 
that 4>(u) = u(x) for each u G X* . 


Proof. We may assume that ||</>|| < 1. Recursively applying Proposition 5.4.13, 
construct a sequence {x n ) n>1 of vectors in X such that for each n, 


n 

4>{u) - ^2 u( Xk) 
k=l 



(u G X*) 


(5.9) 


oo 

and \\x n || < 3/2 n_1 . The series x n then converges to an element x of the com- 

n= 1 

plete space X. Using the linearity and continuity of u, and letting n — > oo in (5.9), 
we obtain the desired conclusion. □ 


Let H be a nontrivial Hilbert space. One of the topologies on B(H) that is 
important in operator-algebra theory is the weak- operator topology r w : that is, the 
locally convex topology defined by the seminorms of the form T \(Tx,y}\ with 
x,y in H. Classically- but not constructively (see Exercise 17) — the sets of the 
type 

n 

T G B(H) : ^2 I {Te uej )\<5 

i,j = 1 

with 5 > 0 and {ei, . . . ,e n } a set of pairwise orthogonal unit vectors in H , form 
a base of weak-operator neighbourhoods of 0 in B(H), so a linear functional 4> on 
B{H) is r U! -continuous if and only if it has the following special continuity property: 



SC There exist 5 > 0 and a set {ei, . . . , e n j of pairwise orthogonal unit vector s 

n 

in H such that for each T G B(H), if |(Tej,ej)| < 5, then \4>(T)\ < 1. 

i,j — i 

We shall use the technique embodied in the proofs of Lemma 5.4.11, Lemma 5.4.12, 
and Proposition 5.4.13 to produce a characterisation of those linear functionals (f) 
on B(H) with the property SC. 
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Before the next result, we mention that, in spite of what we wrote in the pre- 
ceding paragraph, it is constructively provable that the sets of the form 


T € Bi{H) : ]T \{Te u e 0 )\<5 

i,3 — 1 

with S > 0 and {ei, . . . , e n } a set of pairwise orthogonal unit vectors in H , form a 
base of weak-operator neighbourhoods of 0 in the unit ball B\{H). This is left as 
part of Exercise 17. 

Proposition 5.4.15. If H is a nontrivial Hilbert space, then Bi(H) is T w -totally 
bounded. 

Proof. Let {ei, . . . , e n } be a finite set of pairwise orthogonal unit vectors generating 
a finite-dimensional subspace Hq of H. In view of the observation preceding this 
proposition, it will suffice to prove that Bi(H) is totally bounded with respect to the 

n 

seminorm pj k : T \('^' e j- e k) I ■ Let P be the projection of H on H 0 . Note that 

j,k = l 

B(Hq) is a finite-dimensional Banach space, and hence has a totally bounded unit 
ball, relative to the operator norm. Let {T°, . . . , T^} be an e/n 2 -approximation to 
Bi(H 0 ), and consider any T € B\(H). The restriction ( PT) 0 of PT to H 0 belongs 
to Bi(H 0 ), so there exists i such that ||(PT) 0 — T°|| < e/n 2 . Also, TfP € B\(H). 
Thus if 1 ^ j, k ^ n, then 

| <(T - T°P) ej ,e k ) | = | < (T - 7;°) e 3 ,Pe k ) | = | (P (T - T?) e jt e k ) \ 

= | ( ((PP) 0 — Tf ) ej ,e k )\ ^ ||(PT) 0 -I?|| < 

n 

Hence |((P — TfP) e 3 , e k )\ < e. We now see that {T°P, . . . ,I^P} is an e- 

j,k—l 

approximation to B\{H) relative to the seminorm p 3k . □ 



Proposition 5.4.16. Let H be a nontrivial Hilbert, space, and let <p be a linear 
functional on B(H) with the property SC. Then for each £ > 0 there exist a fi- 
nite set {ei,...,e n } of pairwise orthogonal unit vectors in H , and elements Cj k 
(1 ^ j,k ^ n) of K, such that 


m E Cj k \h ej , e k ) 

j,k = 1 


< £ 


(T e Bi(H)) . 


Proof. With the proofs of Lemma 5.4.11, Lemma 5.4.12, and Proposition 5.4.13 at 
hand, we omit some of the grisly details of this one. We first note that, in view of the 
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observation immediately preceding Proposition 5.4.15, <p is r u ,-uiiiformly continuous 
on B\(H). It follows from this, Proposition 5.4.15, and Proposition 5.4.3 that 

11011 = sup{|^(T)| : T £ Bi(H)} 


exists. Scaling if necessary, we may assume that there exists T 0 £ B\(H) such that 
4>(Tq) = 1 . Choose a finite set {ei, . . . , e„} of pairwise orthogonal unit vectors in H, 
and a positive number S, such that if T £ B(H ) and . =1 \{Tei,ej)\ < 26, then 
\<f> (T) | < 1 . Let Hq = span {ei, . . . , e n } , and for each x = (xi, . . . , x n ) in the n-fold 
product space Hq, define the product norm 

n 

ll x ll = M • 

k = 1 

Note that for each T G B\(H) and each x G Hq , 

n n n 

^2(Tx k ,e k ) < 51 \( Tx k’ e k)\ < ^2 Ikfcll = ||x|| • (5.10) 

k= 1 fc=l k = 1 


For such x, the mapping 




^ ( Tx k ,e k ) 
k = 1 


is r u , -uniformly continuous on B\(H). Since (by Proposition 5.4.15 and Lemma 
5.4.9) the unit kernel Jf\ of <f> is -tot ally bounded, it follows from Proposition 
5.4.3 that we can define a seminorm on Hq by 


= sup ■ 


^2(Tx k , 


^k / 


k = 1 


T£ m 


Then || || 0 ^ || || , by (5.10), so the mapping || || 0 from (Hq, || ||) to R. is uniformly 
continuous. Since Hq is finite-dimensional, the set 


{x £ HZ : ||x|| = 1} 


is || 1 1 -compact. Hence 


(3 = inf { 1 1 x 1 1 0 :xefl 0 ", ||x|| =1} 

is well defined. This time (cf. Lemma 5.4.12) we prove that (3 = 0. Suppose that 
(3 > 0. Then (Hq, || || 0 ) is a finite-dimensional Banach space, and 

IMI° = sup{|u(x)| : x G HZ, ||x|| 0 < 1} 

defines the corresponding operator norm on the dual space (Hq)* . For each T £ Afi 
and each x £ H^ let 
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n 

F(T)(x) = £ (TXk-, &k) • 
fc= 1 

The mapping F : N\ — > ((Hq)* , || ||°) is r^-uniformly continuous, so F(M\) 
is totally bounded, and therefore located, in ((Hq)* , || ||°). Using the separation 
theorem as in the proof of Lemma 5.4.12, we can show that F(Afi) is dense in the 
unit ball Sq of (Hq)* , || ||°). Since the linear functional 

n 

\ X 'W E (f3T 0 x k ,e k ) 

fc= l 

belongs to Sq, given t > 0 we can find T G such that \\F — F(T)||° < n~ 2 St. 
Write 

{ {]■ k) : 1 < j, k < n} = P U Q, 

where P and Q are disjoint sets, 


(j> k) G P ==> (((3T 0 — T) ej,e k ) ^ 0, 

(j,k) G Q => \(((3T 0 — T) ej,e k )\ < n~ 2 5t. 

If (j, k) G P , set 

rjk = ((0T O - T) ej,e k )~ x \(((3T 0 - T)ej,e k ) | , 
and if (j, k) G Q , set rj k = 0. Now define £ € Hq by 


to obtain 


and hence 


Thus 


4 = n 2 ^2 r Jk e j (1 s$ k < n) , 
j = i 


ii^fcii < n 2 Em ^ n 1 

3 - 1 


ii€iio<ii€ii = Eii^ii< L 

k = 1 


J2((PT 0 ~T)y,e k ) 

k = 1 


< n 2 6t. 


(5.11) 


Moreover 
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n n n 

n 2 ^((/3T 0 -T)e fe ,e fc ) = <(/JT„ - T ) e j, e k) 

k— 1 k— 1 j = l 

Y r jk((PT 0 -T)e j ,e k ) 

ti,k)£P 

= J2 \((PT 0 -T) ej ,e k )\, 

(. i,k)eP 

SO 

n 

£ |<(/jr 0 -T)e J -,e fc >|< Y \((PT 0 -T)e j ,e k }\+ Y \{Wo - T) ej ,e k )\ 

3,k= 1 (j,k)£P (j,fc)eQ 

n 

^ n 2 ^ ((/?T 0 - T) £ fc , e fe ) + n 2 (n~ 2 St) ^ 2<ft, 

fc=i 

the last step following from (5.11). Hence 

n 

Y Y ((f3T 0 -T) ej ,e k }\ <26 

j,k = 1 

and therefore |<^(/3To — T)| < t. We now have 

/3 = /ty(T 0 ) - 0(T) = 0(/3 T 0 —T)<t. 

Since t > 0 is arbitrary, it follows that (3 = 0. 

Thus for each a > 0 there exists z £ Hq such that |jz|j = 1 and 

n 

Y(Tz k ,e k ) < (2?r) _1 a (T e M) ■ 

fc=l 

We can arrange that each z k yf 0, provided we replace (2 ?r) _1 by n _1 in the last 
inequality; we assume that this has been done. Therefore 

n 

Y((T-<t>(T)T 0 )z k ,e k ) <n" 1 ( 1 + M)a (T £ B\(H)) . 

k = 1 


We now introduce an operator Ti that will enable us to bound (T^Zk, e^) away 

k = 1 

from 0. Setting 


Tix = n 1 H^fcll 1 {x,z k )e k (x £ H) 


we have 
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||Tia;||<n 1 ^||z fc || 1 INI INII = 11*11 » 

k = 1 

so T\ G Also, 

n n n 

Y (Tiz k ,e k ) = n~ l Y Ikfcll " 1 {zk,z k ) = ^ _1 ^ 


Zfcll = n x . 


/c— 1 


fc=l 


fc=l 


Hence 


n 1 < 

n 

Y{{Ti-‘KTi)T 0 )z k ,e k ) 

+ \m)\ 

n 

^ ^ (T 0 z k , e k ) 


k = 1 


k = 1 


^ n 1 (1 + ||0||)a- 


^ ^ (^0 Z k f Cfc) 


fc=l 


and therefore 


y, {T 0 z k ,e k ) 


k = 1 


> (nll^ll)- 1 (1-(1 + W) «) >(2n||^||)- 1 


provided a is small enough. Setting 

x= ("^(To^fc.efc)^ z, 

Vfc 

for each T G Si (if) we now obtain 


\k= 1 


m-Y {TXk, €-k) 


Y (<l > ( T ) T 0 Z k, e k) - Y ^ TZk ' ek > 


Y( T 0 Z k, e k) 

k=l I fc= 1 I | fc=l fc=l 

< 2?r||0||?r' 1 (l + ||<^||)a 

= 2W(1+M)a, 

which can be made less than e by choosing a suitably small. It remains to take 
Cj k = (%k, e j) for 1 < j, k < n. □ 


Exercises 

1. Let U, V be subsets of a Banach space X such that U U V is dense. Prove the 
following: 

(a) If u € U and v £ V, then for each £ > 0 there exist t £ [0,1] and x £ U fl V 
such that ||ar — tu — (1 — t)v\\ < e. 
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(b) For each x G X , 


P (x, U l~l V) = min {p(x, U),p(x, V)} , 

in the sense that each side of this equation exists if and only if the other 
does. 

2. We say that a subset 5 of a normed space X has the boundary crossing property 
if for each £ G S, each z € ~S, and each e > 0 there exist t G [0, 1] and x G dS 
such that \\x — t£ — (1 — t) z\\ < e. Prove that if X is complete and S U ~S is 
dense, then S has the boundary crossing property. Do we necessarily obtain 
this conclusion if S U ~ S is not known to be dense in XI 

3. Let S' be a subset of a Banach space such that SU~S is dense, and let Xq G ~S. 
Prove that p(xo,dS) exists if and only if p(xo,S) exists, in which case these 
two distances are equal. 

4. Let {ei, . . . , e„} be the canonical orthonormal basis of the Euclidean space R", 
and E the simplex whose vertices are ±ej (1 ^ j ^ n). Let £ belong to the 
metric complement —E of E in R". Prove that the segment [0, £] contains 
points arbitrarily close to the union of the faces of E. 

5. Show that if the Hahn-Banach theorem holds with exact preservation of the 
norm of the functional, then LLPO holds. 

6. Let Y be a finite-dimensional subspace of a normed space X , and Xq G X. 
Suppose that p(xo,Y) > 0 and that X is the span of Y U {a’o} . Let u be a 
nonzero normed linear functional on Y. and £ > 0. Without using the separation 
theorem, the Hahn-Banach theorem, or any of their consequences, prove that 
there exists a normed linear functional u on X such that ||it|| ^ ||u|| + £ and 
u(y) = v(y) for each y G Y. Use this to give another proof of the Hahn-Banach 
theorem. 

7. Use the Hahn-Banach theorem to prove Proposition 5.3.1. 

8. Let p be an integer ^ 2. Prove that the Banach space l p is uniformly convex 
and has Gateaux differentiable norm. (See Exercise 11 of Chapter 4.) 

9. Let X be a normed space with Gateaux differentiable norm. Let C be a closed, 
located, convex subset of X, let x G — G, and let y G C. Prove that ||x — y\\ = 
p (x, C) if and only if Re u x _ y (z — y) ^ 0 for all z G C. 

10. Let X be a uniformly convex Banach space with Gateaux differentiable norm. 
Let A and B be subsets of X whose algebraic difference is located and convex, 
and whose mutual distance is positive. Prove that there exists a normed linear 
functional u on X 1 with norm 1, such that Re u (y) ^ Re u ( x ) + d for all x G A 
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and y G B. (The results embodied in Exercises 9 and 10 are due to Ishihara 
[55].) 

11. Prove that if, under the other hypotheses of Lemma 5.4.9, the conclusion holds 
without the hypothesis that (f> is either 0 or nonzero, then we can derive the 
essentially nonconstructive proposition 

K (x = 0 V -> (x = 0)) . 

12. Let / be a uniformly continuous linear functional on X, and S a balanced, 
totally bounded subset of X. Prove that the set 

S t = {x G S : |/0r) | < t} 
is totally bounded for each t > 0. 

13. Let (X,p) be a seminormed space, and S a balanced, totally bounded subset 
of X. Let the mapping / : X — > IK be both uniformly continuous on S and 
homogeneous — that is, / (Ax) = A f(x) for all A € IK and x € X. Prove that the 
set 

S t = {x G S : |/ (x) | < t} 
is totally bounded for each t > 0. 

14. Let X be a finite-dimensional locally convex space, and let S be the set of 
convex combinations of points in an inhabited finitely enumerable subset of X. 
Prove that S is totally bounded in X. 

15. Fill in the missing details in the proof of Proposition 5.4.16. 

16. Let (e„) n>1 be an orthonormal basis of a separable Hilbert space H, and let 

0 be a linear functional on B(H) that is weak-operator uniformly continuous 
on Prove that there exist complex numbers Cj k ( j,k ^ 1) such that 

OO 

0(T) = E c jk {T ej ,e k } for all T G B(H). 
j,k=l 

17. Prove the following for a nontrivial Hilbert space H: 

(a) The sets of the form 


T&B^H) : £ |(Te;,ej)| < <5 ^ , 

*.j= i 


with 5 > 0 and {ei, . . . , e n } a set of pairwise orthogonal unit vectors in H 1 
form a base of weak-operator neighbourhoods of 0 in the unit ball B\(H). 
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(b) If B\{H) can be replaced by B(H ) in (a), then LPO holds. 

18. Let C be a convex subset of C” such that sup {f(z) : z £ C} exists for each real 
(bounded) linear functional / on C™. Prove that C is totally bounded. (Hint: 
Prove that C is bounded; that 

d(x) = sup | inf Re /(a; — y) : / € S* 

exists for each x £ C n , where S* is the unit ball of the dual of C n ; and that 
p(x,C) exists and equals d(x).) 

19. Let X,Y be normed spaces. We say that (f> is ultraweakly continuous if it is 
uniformly continuous on B\(X,Y) relative to the locally convex topology t w 
defined on B(X, Y) by the seminorms T y*(Tx) with x £ X and y* a normed 
linear functional on Y. Prove that the following conditions are equivalent: 

(a) B\ (X,Y) is r^-totally bounded. 

(b) Every ultraweakly continuous linear functional on B(X,Y) is normed. 

(c) Every ultraweakly continuous linear functional on B (X,Y) is compact. 


Notes 

A classical method of establishing the continuity of the boundary crossing map in 
Proposition 5.1.5 is to use a contradiction argument to prove that the mapping is 
sequentially continuous on — C; see [87] (pages 271-272). Our argument, based on 
the simple Lemma 5.1.3, is much more transparent and shorter than either that 
classical argument or its constructive counterpart (which is given in [32] and proves 
only the sequential continuity, not the full continuity, of the boundary crossing 
map). More information about boundary crossing can be found in [36], from which 
Exercises 2-4 are taken. 

The Hahn-Banach theorem can be proved without recourse to the separation 
theorem (Exercise 6). Using the Church-Markov- Turing thesis, Metakides et al. [71] 
have produced an explicit example in which the hypotheses of the Hahn-Banach 
theorem hold but the linear functional cannot be extended with exact preservation 
of its norm. This makes Theorem 5.3.7 (which, together with Exercises 9 and 10, 
is due to Ishihara [55]) all the more significant, especially as it applies to many 
important examples of normed spaces, such as the L p spaces for p > 1. 

The locally convex topology on a finite-dimensional space X is unique and 
equivalent to the topology induced by any norm on X. The proof of this depends 
on some intricate geometric algorithms and is found in [39] . 
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Finding necessary and sufficient conditions for the existence of the norm of a 
bounded linear mapping T : X — > Y between normed spaces, other than the case 
Y = K, is an important unsolved problem, even in the case where T is an operator 
on a Hilbert space. 

The Banach- Alaoglu theorem (Theorem 5.4.7) without the hypothesis of sepa- 
rability is classically equivalent to the ultrafilter principle — every filter is contained 
in an ultrafilter (see [80], page 766) — and is therefore unlikely to be provable con- 
structively; see Exercise 1.4 of Chapter 1. 

The introduction of the numbers rjk in the proof of Proposition 5.4.16 is occa- 
sioned by the problem with the modulus-argument decomposition discussed at the 
start of Chapter 2. 

Proposition 5.4.16 is a step in the direction of a constructive proof of the classi- 
cal characterisation of weak-operator continuous linear functionals as those of the 

n 

form T Y ( Txk,yk ) with the vectors Xk, yk hr H. All attempts to generalise the 
fc= l 

technique used to prove Proposition 5.4.13 in order to characterise ultraweakly con- 
tinuous linear functionals — those linear functionals on B(H) that are weak-operator 
uniformly continuous on B\{H ) — suffer from a curse of dimensionality, failing be- 

n 

cause the n appearing in expressions like Y ( Txk,yk ) is usually more than 1. 

fc= l 

Although Exercise 16 requires only the weak-operator uniform continuity of the 
functional on the unit ball of B(H) and derives a stronger conclusion than that of 
Proposition 5.4.16, the solution that we have in mind for that exercise works only 
for a separable Hilbert space. In that case we can improve the conclusion to find 
square-summable sequences {x n ) n>1 and (y n ) n > i hr H such that for each T € B(H) 

OO 

we have (j>(T) = Y { T x n ,y n )\ see [23]. 

71=1 

If H is a Hilbert space, then Bi(H) is classically weak-operator compact; how- 
ever, the completeness of Bi(H) relative to the uniform structure associated with 
the weak-operator topology is an essentially nonconstructive property; see [14]. 

The result in Exercise 18 first appeared in [56]. 
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Operators and Locatedness 


Location! Location! Location! 


— Unknown Estate Agent 


We begin by introducing normed spaces on which the norm is differentiable in some 
fashion. In Section 2, with substantial help from the A-technique, we provide criteria for the 
locatedness of certain convex sets in a normed space. This work is applied in Section 3 to 
proving that a bounded operator on a Hilbert space H has an adjoint if and only if it maps 
the unit ball to a located set. In the next section we construct approximate eigenvectors of 
a selfadjoint operator on H, and then show that a bounded positive operator has a unique 
positive square root. This result is applied in Section 5, in which we make further good 
use of the A-technique to show that, for a so-called weak- sequentially open operator T on 
H, the range of T is located if and only if the range of its adjoint is located. The section 
ends with a proof of the closed range theorem for operators with an adjoint. The final 
section of the chapter presents a version of Baire’s theorem, which is then applied to the 
proofs of three of the pillars of functional analysis: the open mapping, inverse mapping, 
and closed graph theorems. 


6.1 Smooth and Uniformly Smooth Normed Spaces 

As the reader will have observed, locatedness, classically a nugatory concept, plays 
an important role in many aspects of constructive analysis, such as the Hahn- 
Banach theorem and the separation theorem discussed in the preceding chapter. It 
is therefore fitting that criteria for locatedness should be a primary theme of this 
final chapter. Our first aim is to find necessary and sufficient conditions for the 
locatedness of convex sets in a normed space; this requires some preliminary results 
associated with the differentiability of the norm. Later in the chapter we shall apply 
our conditions to the locatedness of subsets associated with operators on a Hilbert 
space. 
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Throughout this section, unless we state otherwise, X will be a normed linear 
space. We say that X is smooth if its norm is Gateaux differentiable at each nonzero 
vector; this is the case if and only if the limit 


u x (y) = Jim 


\\x + ty\\ - ||s|| 
t 


(6.1) 


exists for all unit vectors x,y £ X. We say X has Frechet differentiable norm if this 
limit is uniform in the unit vector y: that is, if for each unit vector x £ X and each 
e > 0 there exists 5 > 0 such that 


\\x + ty\\ - ||x|| 
t 


u x {y) 


< £ 


( 6 . 2 ) 


whenever y £ X, ||y|| = 1, and 0 < |t| < S. If, moreover, 6 can be chosen inde- 
pendent of x, then we say that X has uniformly Frechet differentiable norm and 
is uniformly smooth. For this to be the case, for each e > 0 there must exist S > 0 
such that (6.2) holds whenever x,y £ X, ||a:|| = ||y|| = 1, and 0 < \t\ < 6. 

With each vector x in a smooth normed space X we associate the mapping 
J x : X — ► R defined by 


J x (y) 


lim lladl 
t-> o 


\\x + ty\\ - INI 

t 


If x = 0, then J x (y) = 0; if x 0, then J x (y) = ||a:|| u x (y). In both these cases, 
\J x (y)\ ^ ||x|| ||y|| . In order to show that J x (y) is defined for an arbitrary element 
x of X, construct an increasing binary sequence (A„) n>1 such that 

A„ = 0 => || x || < -, 
n 

An = 1 => Ikll > 

We may assume that Ai = 0. If A n = 0, set = 0; if A n = 1, set ( k = ||x|| u x (y) for 
all k ^ n. Then |^ m — C„| ^ II 2/11 / n whenever m ^ n > 1; so (C n ) ra >i a Cauchy 
sequence and therefore converges to a limit ( £ R. Note that |£| < ||x|| ||j/|| . For 
each £ > 0, either ||a:j| ||j/|| < e/2 or ||a;|| j|y|| > 0. In the first case we have 


x || \\x + ty\\ - ||x|| 


^ 2 | 


112/11 


< £ 


for all t ^ 0. In the second case, ( = J x (y) and 



c 


< £ 


for all sufficiently small t / 0. Since £ > 0 is arbitrary, we conclude that J x (y) 
exists and equals 
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Proposition 6.1.1. If X is smooth, then for each x in X, J x is a normed linear 
functional on X, ||J X || = ||x||, and J x {x) = ||x|| . Moreover, if c > 0 and x £ X, 
then J q X — e ' l j • 

Proof. If x 0, then the first three conclusions follow from Proposition 5.3.6. On 
the other hand, if also c > 0, then 

J cx — || X-X 1 1 U cx — C 1 1 X 1 1 U x — cJ x . 

The extension to the general case is left to the reader. □ 


Lemma 6.1.2. If X is smooth, then for all x, y £ X, 

\\yf < INI 2 -2 J v (x-y) . 

Proof. Using the linearity of J y and also Proposition 6.1.1, we compute 
Jy {x~y) = J v {x) - || yf < || y|| ||z|| - ||t/|| 2 • 

Hence 

irf - \\yf 2 J y (x-y)> ||z|| 2 + \\y\\ 2 - 2 ||z|| ||y|| = (||x|| - \\y\\) 2 > 0. 


□ 


Lemma 6.1.3. Let X be a smooth normed space, and x a unit vector in X. Let 
S,e be positive numbers such that if ||t/|| = 1 and 0 < |t| < 6 , then 


\x + ty || - ||x|| 


t 


Jx{y) 


< £. 


Then for such t and each unit vector y, 

||x + ty\\ + ||x - ty|| < 2 + 2 te. 

Proof. We may assume that 0 <t < 6. If ||y|| = 1, then 

\\x + ty\\ - ||x|| 


t 


and 

Hence 

and 


‘ — * 2/11 - 


-t 


- Jx(y) < e 


- J x (y) > -e- 


■ + ty || < 1 + tJ x (y) + t£ 


\x - ty || < 1 - tJ x (y) + t£, 


from which the result follows. 


□ 
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We store the following result for use in Section 2. 

Proposition 6.1.4. Let X be a smooth, uniformly convex Banach space, and let f 
be a normed linear functional on X. Then there exists a unique vector x € X such 
that f = J x . 

Proof. First assume that / is nonzero. By Proposition 2.3.7, there exists a unit 
vector y € X such that f(y) = ||/||. Applying Proposition 5.3.6 to ||/|| -1 /, we see 
that / = ||/|| u.y. Now let x = ||/|| y. Then ||x|| = ||/|| and (since u cy = u y for all 
c > 0) 

/ = ll/ll u v = Ml W||/||j, = INI u x = J x . 

It remains to deal with the uniqueness. Let / = J z for some z € X. Then, by 
Proposition 6.1.1, ||z|| = || J z \\ = ||/|| > 0, so 

\\f\\Uy = f= ||/|| U z = ||/|| «(| Z ||-1 2 

and therefore U\\ z n-i z = u y . Since ||z|| 1 z is a unit vector, the uniqueness part of 
Proposition 5.3.6 shows that ||z|| 1 z = y\ whence 

2 = \\z\\y= \\f\\y = x. 

This completes the proof in the case where / is nonzero; the general case is left as 
an exercise. □ 


The next lemma simplifies the proof of the theorem following it. 

Lemma 6.1.5. Let f be a mapping o/R. — {0} into R such that 

(a) if 0 < t' ^ t, then f{—t) ^ /(— t') ^ /(£') ^ /(£), and 

(b) for each e > 0 there exists t > 0 such that f(t ) — /(— t) < e. 

Then lim t ^o /(£) exists. 

Proof. Construct a sequence {t n ) n>1 of positive numbers decreasing strictly to 0 
such that /(£„) — f(—t n ) < 1/n for each n. In view of (a), for m^nwe have 

0 < f(t m ) - / (t n ) ^ f(t m ) - f(-t m ) < — . 

m 

Hence (/(£«))„> i is a Cauchy sequence in M and so converges to a limit l € M such 
that 0 ^ f(t m ) ^ ^ 1 /m for each m. Given e > 0, choose t > 0 as in (b) and 

then N such that tjsr < t. and 0 ^ /(In) — l < e. By (a), if 0 < |t| < £at, then 
f(~t N ) < fit) < f(t N ) and therefore | /(£) - Z| < f(t N ) - l < e. □ 
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We have already defined “uniformly convex” for normed spaces, in Chapter 2. 
To introduce this notion for the dual of a normed space X, we concentrate on those 
elements of X* that have a norm. We say that X* is uniformly convex if for each 
£ > 0 there exists 8 with 0 < 8 < 1 such that for all normed u,v £ X* with 
IMI = IMI = ^ there exists a unit vector (el such that \ (u + v) (£) > 1 — 8 , 

then (u — v) (x) < £ for all unit vectors x. 

Proposition 6.1.6. A separable normed space is uniformly smooth if and only if 
it has a uniformly convex dual. 


Proof. Let X be a separable normed space, and suppose to begin with that X* is 
uniformly convex. Given unit vectors x, y £ X 1 for all nonzero t £ R. define 

= \\x + ty\\ - |[x|| 

Then / satisfies (a) of Lemma 6.1.5. It follows that in order to prove that X is 
uniformly smooth, we need only show that for each £ > 0 there exists t > 0, 
independent of the unit vectors x and y, such that f(t) — f{—t) < e. To this end, 
let 8 > 0 be as in the definition of “uniformly convex” above, fix t > 0 such that 

8 f(2 + t) <7 5, 


and consider any unit vectors x, y £ X. Note that ||x + ty || >1/2 and ||x — ty\\ > 
1/2. Defining unit vectors £,77 by 


£ 


1 

\\x + ty\\ 


( x + ty ), 


tl = 


f(* 


ty), 


we obtain 

U-V\\ = I, , . imi 7 [7 \\\\x-ty\\ {x + ty) - 1 1 x + ty\\ (x - ty) || 

\\x + ty\\ ||x — ty\\ 

<411(11^-^11 - \\x + ty\\) x + t (\\x - ty\\ + ||x + ty||)y|| 

< 4 [|||* — ty\\ - \\x + ty\\\ +t (||x - ty\\ + ||x + ft/||)] 

< 4 (2 1 + 2t(l + t)) = 8t{2 + t) <8. 


Now let 

. f te 

1 \ 4(1 + t) 2 f 

and use Proposition 5.3.1 to construct normed linear functionals u and v such that 

IMI = IMI = 1) u {0 >1—7: an d v (v) >1 — 7- Then 


(u + v) (0 


+ v{ri) + v - 77) > 2 - 27 - ||£ - 77H ^ 2 



-8 = 2-28 


and therefore \ (u + v)(£) > 1 — 8. It follows that (u — v) (z) < e/2 for all unit 
vectors z £ X, and in particular that (u — v) (y) < e/2. Now, 
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u(x + ty) = (I® + ty || u( 0 > ||® + ty\\ (1 - 7) , 


SO 


ts 

II® + ty | < u(x + ty) + ||® + ty || 7 ^ it(® + ty) + (1 + t ) 7 ^ u(® + ty) + — . 


Likewise, 


ts 

\x~ty\\ < v(x — ty) + — . 


Thus 


I* + ty\\ - W ||x - « - 114 = i (||I + ^ + ||x _ m _ 2) 


~t 


te 


^ - [u(x + ty) + v(x - ty) + — - 2 


te 


= - + v(x) + t{u - v)(y) + — - 2 

< ^ + (« - u) (y) < £• 

This completes the proof that X is uniformly smooth. 

Now suppose, conversely, that X is uniformly smooth. Then for each e > 0 there 
exists 5 such that 

0 < S < min • 

and such that for all unit vectors ®, y € X and all t with 0 < |f| < 45/e, 

£ 




t 


Jx{y) 


< 


Letting t = 25/e, we see from Lemma 6.1.3 that for all unit vectors x,y £ X, 

II® + ty\\ + ||® - ty\\ < 2 + y. 

Now consider normed u, v G X* with ||u|| = ||®|| = 1, and assume that ^(■u+®)(^) > 
1 — 5 for some unit vector £ € X. Then for all unit vectors i£lwe have 

2-y = 2 -5< (u + u)(0 

= it(£ + tx) + ®(£ — tx) — t(u — v)(x) 

||C + tx || + lie - t®|| - t(u - v)(x) 

< 2 + y — t(u — v)(x) 


and therefore (u — t;)(®) < £• Thus X* is uniformly convex. 


□ 
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If H is a Hilbert space, then, in view of the Riesz representation theorem, its 
dual is, like H itself, uniformly convex. It follows from Proposition 6.1.6 that a 
separable Hilbert space is uniformly smooth. In fact we can remove separability 
and completeness here. 

Proposition 6.1.7. An inner product space is uniformly smooth. 


Proof. Given unit vectors x, y in the inner product space X , for real t we have 


and therefore 


\x-ty || 2 - j|x|| 2 = -2tRe(x,y) +t 2 
— 2tRe (x,y) + t 2 


x - ty - x = 


-ty II 


It follows that if 0 < \t\ < 1/2, then 


\x-ty\\ - ||x|| 


t. 


+ Re (x, y) 


—2 Re (x, y) + t 


\\x-ty\\ + ||x| 

1(1 

x-ty || + 1 IU 


+ Re (x, y) 


x-ty || - 1) Re (x,y) +t\ 


^ 2 f(Re(x,y) + l)^ 4 ^, 


which tends to 0 with t, independently of x and y. 


□ 


6.2 Locatedness of Convex Sets 

Our aim in this section is to give conditions ensuring the locatedness of a bounded 
convex subset of a normed space under certain strong hypotheses that apply, in par- 
ticular, to the norm on a Hilbert space or an Lp-space 1 for 1 < p < oo. Specifically, 
we aim to prove the following two results and some of their consequences. 

Theorem 6.2.1. Let X be a uniformly smooth normed space over R, and C an 
inhabited, bounded, located convex subset of X. Then 

sup {J x (y) •■y&C} 


exists for each x £ X. 


1 We refer the reader to [9, 12] for the theory of L p spaces. 
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Theorem 6.2.2. Let C be an inhabited, bounded, convex subset of a uniformly 
smooth normed space X over R, such that 

sup {J x (y) -y eC} 

exists for each x £ X. Then C is located. 


The path to Theorem 6.2.1 takes us through a tangle of technical lemmas, the 
first of which is an expression of the continuity of J x as a function of x. 

Lemma 6.2.3. Let X be a real normed space with a Frechet differentiable norm. 
For each unit vector x € X and each e > 0 there exists S > 0 such that if ||y|| < 6, 
then \.J x (z) — J x + y (z) \ < £ for all unit vectors z £ X. 


Proof. Given a unit vector x £ X and £ > 0, compute 7 £ (0, 1) such that if ||z|| = 1 
and 0 < |t| < 7, then 


\\x + tz\\ - INI 
t 


Jxi.z) 



(6.3) 


Let 


c • .V 1 
0 = mm \ —— , — 
32’ 2 


and consider a vector y £ X with \\y\\ <6. Noting that ||ar + y\\ > 1/2, set 

1 


£ = 


y II 


(x + y). 


Then since 

|i-lk + y||| = IW-ll* + »lll<lli/ll, 

we have 

II® + 2/11 ll£-®ll = \\x + y~ ||® + y\\ ®|| = ||(1- II® + 2/11)® + 2/II < 2||y|| . 
Also, by Proposition 6.1.1, •/{(£) = ||£|| 2 = 1; so 


0 < 1 - M x ) = Jt(Z - ®) < U - ®ll < < 4 <*- (6- 4 ) 

On the other hand, setting t = 7/2 and applying Lemma 6.1.3, we see that for each 
unit vector z £ X, 

||® + tz\\ + ||® - tz\\ < 2 + 

Therefore, by (6.4), the choice of t and <5, and Proposition 6.1.1, 
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1 - ^ < 1 - 4 6 
4 

< M x ) 

= J x {x) + J(.(x) - 1 
= J x (x + tz) + J^(x — tz) — 1 — t( J x — 

< ||ar + tz\\ + ||ar - tz\\ - 1 - t(J x - J^)(z) 

< — + 1 - t(J x - Jc){z). 

Hence (J x — J^)(z) < e/2. Replacing z by —z, we obtain — (J x — J^)(z) < e/2 and 
therefore | (J x — J^)(z)| < e/2. It follows from this and Proposition 6.1.1 that 

I (Jx - Jx+ y ){z) | = |||a: + y\\ (■ J x ~ Jt)(z) + (||x|| - ||x + y||) J x {z)\ 

< Ik + J/ll K’/x - Jz)(z)\ + |||z|| - \\x + y\\\\.J x (z)\ 

< | l(J*-^)WI + lli/ll INI INI 


as we wanted. 


□ 


Lemma 6.2.4. Let X be a uniformly smooth normed space over R, and let C be 
an inhabited, bounded, located convex subset of X. Let xo G X and e > 0. Then 
there exists r > 0 such that for all y G C, 

> either there exists x G C such that J Xo fy) + re/ 6 < J Xo ( x ) 

> or else J Xo {z) < J XQ (y) + e for all z G C. 

Proof. Choose M > 0 such that ||x — y|| < M for all x, y G C. If M ||a;o|| < e, then 
for all x,y G C, 

J Xo ( x - y) < INo|| \\x - y\\ < M ||a’ 0 || < e. 

Hence we may assume that M ||a;o|| >0. From the definition of “uniformly smooth” 
and Lemma 6.2.3 we see that there exists S > 0 such that for all unit vectors 
x,y G X, 

• if 0 < |£| < 5, then 


\\ x + ty\\ - INI 

t. 


Jx{y) 


< 


2e 


3|M M' 


and 
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if ||y|| < S, then for all unit vectors z £ X, 


| Jxi.Z) Jx+y(z) | <C 


2e 

3||*o|| M' 


Let 


r = min < 1 


$ ||x 0 | 
M 


and fix y in C. Either p(x 0 + y,C) < ||rco || 2 — re/3 or 

ll*o|| 2 - < p(x 0 + y,C). (6.5) 

In the first case, there exists x £ C such that 

||*o + y-z|| 2 <IM 2 -y. 

It follows from Lemma 6.1.2 that 

||*o|| 2 < ||*o + V - x\\ 2 - 2 J Xo (y - x) < ||xoj| 2 - y - 2 J X0 (y) + 2 J Xo (x) 


and therefore 


TS 

Jx 0 (y) d < Jx 0 ( x )- 


( 6 . 6 ) 


Thus it remains for us to show that if (6.5) holds, and therefore 


Ml 2 - < 11*0 + V - 2|| 2 for all z G C, 


(6.7) 


then we can derive the second conclusion of our lemma. Consider any x £ C. If 
||*o|| \\y - *11 < £, then 


Jx „(*) - J X0 {y ) = J x 0 (* -y)^ ||*o|| II y - *11 < £• 

Thus without loss of generality we may assume that ||£Uo II || 2/ — *|| > 0- Since 
(1 — t) y + tx belongs to the convex set C, we see from (6.7) and Proposition 
6.1.1 that 

-y 2 < ^ X ° + T ^ - “OH 2 ~ IMI 2 

JxQ+Tpy—x) (*o d - r(y *)) ||x 0 || 

Jxo+T^y— x) (*o) d - xJ Xo ^. T ^y— x ^y x) ||*0 || 

< 11*0 + r(y - x)|| ||x 0 || - ||x 0 || 2 + Tj Xo+r{y _ x) (y - x). 


Hence 


0 < ||x 0 | 


l*o + x(y — x)|| - ||x 0 | 


2e 

d" Jxo+T(y—x){y ~ *) d g"- (6-8) 
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The appearance here of the expression 

\\x 0 + r(y - x)\\ - 


E = 


in the presence of a uniformly Frechet differentiable norm suggests that we consider 
the unit vectors 

1 1 , ^ 

U= li II X 0, V =T\ n(y~ x )- 

Foil Il2/-a:|l 


Writing 


we have 


t = 


II v- 


x 


1 


IIj/ _ x\ 

Since 0 < t C 5, it follows that 


;E = 


11*011 

|u + fo|| - ||u|| 
t 


2s 


3||*o|| M 


> 


1 


\\y-x\ 

l 


-E - J u (v) 


-E - 


1 


l|y — *11 Ikoll \\y~x\ 


r Jx 0 (y — x) 


l 


ll^oll l|y — *11 


\x 0 \\ E - J X0 (y - x)\ 


and therefore 




te\\y-x\\ 2s 
3 M 3 ' 


In view of (6.8) and (6.9), it now makes sense to examine 

| Jxo ( y x) — Jxo+r(y—x) {y *) | , 

which, by Proposition 6.1.1, equals 

INI h - z|| I J u (v) - J u +tv(v ) | • 

By our choice of S, this last expression is less than 

2s 


INI \\y- x \\ 


3 ||*o II M’ 


SO 


2s 

\j*o(y ~~ x) — J Xo -f- T (y— x )(y — x) | < — . 


(6.9) 


(6.10) 


It follows from (6.8)-(6.10) that 
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n / IU II \\ x 0 + T (V ~ x)\\ ~ Hzoll , x 2£ 

0 < \\x 0 \\ ^ Jxq +T(y—x)\y X)-\- ^ 





2e 

T’ 


from which we obtain the inequality J Xo {x ) < J Xo (y) + £• Since x was an arbitrary 
point of C, the proof is complete. □ 


Using Lemma 6.2.4 and the A-technique, we can now prove Theorem 6.2.1. 

Proof. Under the hypotheses of Theorem 6.2.1, fix M > 0 such that ||a: — y\\ ^ M 
for all x, y £ C, and consider any element x of X . It is enough to prove that for 
each e > 0 there exists y £ C such that 

J x (z) < J x (y) + e ( z£C ). (6.11) 

For in that case, if 0 < a < f3 and we choose y £ C such that (6.11) holds with 
£ = (j3 — a) /2, then either J x {y) > a, or else J x {y) < |(a + /?) and therefore 
■J x (z) < (3 for all z £ C; so the desired supremum exists by the least-upper-bound 
principle. 

Fixing x € X, y 0 £ C, and e > 0, let r be as in Lemma 6.2.4. We construct an 
increasing binary sequence (X n )n^o with Aq = 0, and a sequence (y n )n^o i n C, such 
that 

T£ 

A n — 0 ^ J X (yn— l) "H ^ < J X (yn)> 

A„ = 1 => J x (z) < J x {y n - i) + £ for all z £ C. 

To do so, we assume that we have constructed Ao, . . . , A n and yo, . . . , y n . If A n = 1, 
set A„+i = 1 and y n + i = y n ■ If A„ = 0, then by Lemma 6.2.4, either J x (y n )+T£ / 6 < 
Jx(y') for some y' € C, or else J x (z) < J x (y n ) + £ for all z £ C. In the first case 
set A„+i = 0 and y n +i = y ’ . In the second case set A„+i = 1 and y n +i = Vn- This 
completes the inductive construction. 

Now choose N such that 

M\\x\\<N T -£. 

If Ajv = 0, then 

Jx{yo) “i - N Jx(vn) 

6 

and therefore 

W-7T < J x (Vn ~ Vo) < M ||a;|| , 

6 

a contradiction. Hence, in fact, Ajv = 1. □ 
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Corollary 6.2.5. Let X be a uniformly convex, uniformly smooth Banach space 
over ]R, and let C be an inhabited, bounded, located convex subset of X. Then 

sup {f(y) : y £ C} 

exists for each normed linear functional f on X. 

Proof. For a nonzero normed linear functional the result follows from Proposition 
6.1.4 and Theorem 6.2.1. The completion of the proof is left to the reader. □ 


Turning now towards Theorem 6.2.2, we prove the following lemma. 

Lemma 6.2.6. Let C be an inhabited, bounded, convex subset of a uniformly smooth 
normed space X over R such that 

sup{J x (» : z £C} 

exists for each x € X, and let e > 0. Then for each x £ X there exists a with 
0 < a < 1 such that for each y £ C, 

> either there exists y' £ C with ||ar — y'\\ < (1 — a) ||x — y|| 

> or else ||x — y | < \\x — z\\ + e for all z £ C. 


Proof. We may assume that e < 1. Fixing x £ X, choose a positive integer M > 
max {e/4, 1} such that ||x — z | < M for each z £ C. Since X is uniformly smooth, 
there exists S with 0 < <5 < 2 such that for all unit vectors u, v £ X, 


0 < \t\ < 6 ■■ 


| u — to|| — 


—t 


- Ju{v) 


< 


8 M ' 


( 6 . 12 ) 


Define 

e5 

a = 

8 M 

and note that 0 < a < 1. Let y £ C. If ||x — y|| < e, then ||x — y|| < ||x — z\\ + £ for 
all z £ C. Hence we may assume that e/2 < ||x — y||. Now, 

sup {J x -y{z - y) : z £ C} = sup { J x - y {z) ■ z £ C} - J x - V {y) 

exists, by our hypotheses. So either J x - y (z — y) < e ||x — y\\ for all z £ C or else 
there exists Zo £ C such that 

^\\x-y\\ < Jx-y(zo- y) ■ (6.13) 

In the first case, for all z £ C we have 
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\x~y\\ =J x - y {x-y) 


= J x -y{x - z) + J x -y{z - y) 

< \\x-y\\\\x-z\\+e\\x-y\\, 


and therefore 


In the second case, setting 


; — 2/11 < \\x-z\\+£. 


s \\x-y\\ 

2 M 


V =y + r{z 0 - y), 


we have 0<t<(5/2<1 and therefore y' € C. Define also 

1 / >, 1 / x , r\\zo-y\\ 

U =T: Ax-y), v=- [, (-0-2/), * = — n rp. 

\\x-y\\ Iko — 2 /|| If - 2/11 

Note, for the definition of v, that 

£ 

IN - 2/11 Ik - 2/11 > Jx-y{zo - 2 /) > 2 ll x - 2/11 

and hence that |ko — y\\ >0. Let 

, = r Iko ~ 2/11 = kko - 2/11 
Ik - 2/11 

Then t < 5 and, by Proposition 6.1.1 and (6.13), 

T T T£ 

— 7 ] 77 k/ (.Zq y) = - — 2 Jx—yAo y) > 


■ — 2/11 


k — 2/11 


2 Ik- 2/11 ' 


Hence 


Ik - 2 /II = lk- 2 /-^ko - 2 /)ll 
= Ik - 2 /II Ik - ^11 

< Ik - 2/11 (jMI - *k»k) + ) by (6.12) 


Since 


Iko - 2/11 < Iko - x|| + |k - 2/11 < 2M 
and (6.13) holds, this gives 


i, .. „ re re 

Ik’ - 2 / || < Ik - 2/11 - p + -r 


2 4 

= Ik - 2 /II - ^ Ik - 2 /II 
= (! — cr) Ik — 2/11 
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and completes the proof that, in this case, the first conclusion of the lemma obtains. 

□ 


We now prove Theorem 6.2.2. 

Proof. Under the hypotheses of Theorem 6.2.2, let x £ X and e > 0. It suffices 
to show that there exists y £ C such that ||x — y\\ < \\x — z\\ + e for all z € C. 
Compute cr > 0 as in Lemma 6.2.6, and let yo £ C. If ||x — j/o || < £, then ||x — j/o|| < 
||x — z\\ + e for all 2 £ C. So we may assume that ||x — j/o|| > 0- Now construct 
an increasing binary sequence (A„) n ^o with Ao = 0, and a sequence (y n )n^o of 
elements of C, such that 

A» = 0 => \\x - y n || < (1 - ct) ||x - y n -i \\ , 

A „ = 1 => ||x — y n || < \\x — z || + e for all z £ C. 

Suppose we have constructed Ao, . . . , A„ and yo, . . . , y n . If A„ = 1, set A„+i = 1 
and y n +i = yn- If A „ = 0, then by Lemma 6.2.6, either ||x — y'\\ < (1 — cr) ||x — y n || 
for some y' £ C, or else ||x — y n || < ||x — z\\ + e for all z £ C. In the first case set 
A„ + i = 0 and y n+ 1 = y'\ in the second case set A„ + i = 1 and y n+ 1 = y n . Since 
0 < a < 1, we see that (1 — a) n — > 0 as n — > oo, so there exists N > 1 such that 

Now check Ao, . . . , Xn- If there exists n ^ N such that A„ = 1, then we are finished; 
if A„ = 0 for all n ^ N, then 

\\x - 2 /jv|| < (1 - <t) n \\x - j/o|| < e < ||x - z\\ + e 

for all z £ C, and again we are finished. □ 


Corollary 6.2.7. Let C be an inhabited, bounded, convex subset of a uniformly 
smooth normed space X overR. Then C is located if and only if sup {J x {z) : z £ C} 
exists for each x £ X. 

Proof. Apply Theorems 6.2.1 and 6.2.2. □ 


Corollary 6.2.8. Let X be a uniformly convex, uniformly smooth Banach space 
over R, and C an inhabited, bounded, convex subset of X. Then C is located if and 
only if sup {f(y) : y £ C} exists for each normed element f of X* . 


Proof. Apply Proposition 6.1.4 and Corollary 6.2.8. 


□ 
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Corollary 6.2.9. Let C be an inhabited, bounded, convex subset of a Hilbert, space 
H. Then C is located if and only if 

sup {Re (x, y) : y £ C} (6. 14) 


exists for each x £ H. 

Proof. Consider first the case K = R. Since H is uniformly convex (see Section 3 
of Chapter 4) and, by Proposition 6.1.7, uniformly smooth, the desired conclusion 
follows from the Riesz representation theorem and Corollary 6.2.8. We now easily 
obtain the result in the case K = C, since C is located in H if and only if it is 
located in the underlying real Hilbert space. □ 


When we work with a Hilbert space rather than a general normed space, we can 
remove the boundedness hypothesis from Theorem 6.2.2. For this we need some 
preliminaries, including an important principle of functional analysis. 

Lemma 6.2.10. Let C be a convex subset of an inner product space X such that 
sup {Re (x, z) : z £ C} exists for each x £ X. Let x £ X and y £ C. Let n be a 
positive integer and let 0 < e < 1. Then at least one of the following alternatives 
holds: 

(a) There exists yo £ C such that ||ar — yo\\ 2 < ||x — y\\ 2 — s 4 /16n. 

(b) There exists zo £ C such that n/4 < ||x — y \\ 2 + ||x — zo\\ 2 ■ 

(c) \\x — y\\ < ||x — z\\ + e for all z £ C. 

Proof. If \\x — y\\ < s, then (c) holds trivially; so we may assume that ||x — y\\ > 
e/2. Our hypotheses ensure that 

M = sup {Re (x — y, z — y) : z £ C} 

exists. Either M < e 2 /2 or else e 2 /4 < M. In the first case, for all z £ C we have 

II® - y || 2 = Re (x-y,x-y) 

= Re (x — y, x — z) + Re (x — y,z — y) 

< II® -2/11 II®- 4 + £ — 


and therefore 

II® - y II < II ® - 4 + 2 \\ x -y\\ < 11* _ + e - 

Hence (c) holds. We may therefore assume that e 2 /4 < M. Choosing Zq £ C such 
that 
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T = Re (* - y, z 0 - y) > — , 


we consider three exhaustive cases. 


Case 1: \\z$ — y\\ < 3r/2. We have 


\x-z 0 \\ 2 = \\x-y- Oo - 2 /)|| 2 

= \\x - yf ~ 2 t + \\z 0 - y\\ 2 

< \\x-yf ~ h 

< Ik- yf -J 


since 0 < e < 1. Hence (a) holds with y 0 = zq. 


Case 2: n/2 < \\z 0 — y\\ . Using the parallelogram identity, we have 

| < ||0o - x) + (x -y ) || 2 

= 2 ||a; - z 0 f + 2 ||* - y\\ 2 - \\z 0 + y- 2*|| 2 


< 2 ||* — 2 0 || 2 + 2 ||* — 2/|| 2 , 


so (b) holds. 


Case 3: r < ||z 0 — y || 2 and ||z 0 — y|| 2 < n. Then 


yo = y+ ,72 ("o - y) 

Iko — 2/|| 


belongs to C, and 


2= f _2rR^ L -y 1 z^-y l 

11*0 -rf 2 


ko — 2/ir 


ko — 2/ir 


< *-: 


1 2 £ 


and so (a) holds. This completes the proof. 


The classical uniform boundedness theorem says: 
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If ( Ti) ieI is a family of bounded linear mappings from a Banach space X 
into a normed space Y such that for each x £ X the family (||Tjx||) igJ is 
bounded, then there exists M > 0 such that ||Tj|| ^ M for each i £ I. 

The reader is invited to consider why this form of the theorem is unlikely to be 
provable constructively. The correct constructive approach is via the contrapositive; 
here is a pretty version due to Royden [78]. 

Theorem 6.2.11. (Royden’s uniform boundedness theorem) Let (T n ) n>1 be 
a sequence of normed linear mappings from a Banach space X into a normed space 
Y such that ||T ra || > n3" for each n. Then there exists x £ X such that ||T n a:|| > n 
for each n. 

Proof. For each n > 1 choose x n £ X such that ||a;„|| = 3~ n and ||T n a; n || > 
fUTJIM.Then 

OO 

x = 4 e x n 

n= 1 

OO 

is well defined, since (by comparison with 3 - ™) the series on the right converges 

n = 1 

in the complete space X. For n ^ 2, letting 

y n = x i H h X n -i, 


we have 

^ ||T„|| ||a; n || < 2 ||T n a: n || = ||T (x n + y n ) + T (. x n - y„)\\ 

< ||T„ (x n + y n ) || + ||T (y n - x n )\\ , 

so at least one of the last two terms is greater than | ||T„|| ||a; n ||. Replacing x n by 
— x n , if necessary, we may assume that 

ll^n (x n + y n ) II > ^ 1 1 Yn 1 1 ||&n|| • 

Hence 


4 IMI = 




T n ( E x k ) + T n ( E Xk 

\k=n -\- 1 / 


\k = 1 


n— 1 


Tn %n H” ^ ^ %k 


k=l 


-\\Tn\\ E H*fcl 

k=n -\- 1 


i l|T n ||3-"-||T n || E 3 

k=n -\- 1 

3 1 


— k 


> 


> \\T n \\3~ n ( - - - ) = - IITkll 3 
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and therefore ||T n x|| >3 n \\T n \\ > n. 


□ 


Theorem 6.2.12. (Uniform boundedness theorem) Let (T n ) >x be a se- 
quence of normed linear mappings from a Banach space X into a normed space Y 
such that ||T n || — * oo as n — > oo. Then there exists x £ X such that the sequence 
(||T n x||) n>1 is unbounded. 

Proof. Compute a strictly increasing sequence (n k ) k>1 of positive integers such that 
||T„J| > k3 k for each k, and then apply Theorem 6.2.11. □ 


It is possible to remove from these two uniform boundedness theorems the con- 
dition that each T n be normed; see Exercise 12. 

Theorem 6.2.13. Let C be an inhabited convex subset of a Hilbert space H such 
that 

sup {Re (x, y) : y £ C} 
exists for each x £ H. Then C is located. 

Proof. Fix £ £ H and y 0 £ C. In order to prove that p((. C ) exists, it is enough to 
show that if 0 < £ < 1, then there exists y £ C such that ||£ — jj|| < ||£ — j/|| + £ for 
all y £ C. If ||£ — 2/0 1 1 < e > then we may take y = yg. Hence we may assume that 
£ yo- Setting Ao = 0, for the purposes of this proof we extend the A-technique by 
constructing a sequence (A„) n>0 with values in { — 1,0,1}, and a sequence (y n ) n > o 
in C, such that for each n ^ 1, 

IIC Un || < M-V n -l\\ 


and the following hold: 

A„ = -1 => lie - Un 1 1 2 < lie - Un- 1 || 2 - £ 4 /16?Z, 

A„ = 0 => n/4 < \\( - y n \\ 2 + ||£ - 2 r|| 2 for some z £ C, 

a„ = i => ne - Un i < lie - a\ + e for a11 z£c. 

Assume that we have already constructed Aq , . . . , A n and yo,...,y n . If \ n = 1, 
set A n+ i = 1 and y n +\ = y n . If A„ 1, then we apply Lemma 6.2.10. If there 
exists y £ C such that ||e — y \\ 2 < 1 1 e — 2/n 1 1 2 — £ 4 /16n, we set A„+i = —1 and 
■y n + 1 = y, if n/4 < lie - Vn\\ 2 + lie ~ z\\ 2 for some z £ C, we set A n+ i = 0 and 
Vn + 1 = Vn ; if lie - Vn\\ < ||e - 2:|| + £ for all z£C, we set A n+ i = 1 and y n+1 = y n . 
This completes our inductive construction of the sequences (A„) n>0 and (y n ) n > o • 
It remains to compute n such that A n = 1. 

Let (Nk) k>i be a strictly increasing sequence of positive integers such that for 
each k, 
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N k + 1 4 

N k > 4:(k+ ||£-yo|| 2 ) and ||£-y 0 || 2 < 5Z 

n=N k + 1 
oo 

(recall that 1/n diverges to infinity). Observe that if \ n = —1 whenever iVfc < 

n = 1 

n ^ fVfc + i, then 

JVfc+i 4 Affc+i 4 

°< ||^-2/Jv fc+1 || 2 < ||^-t/jvj| 2 - ^ < ll£“ 2/o|| 2 - 51 ^ <0 ’ 

n=iVfc + l n=Wfc + l 

a contradiction. Thus for each k there exists nk such that Nk < nk ^ Nk+i and 
either A„ fc = 0 or \ nk = 1. In the first case we choose z' € C such that 

^< ll £- 2 Mj 2 + ||£-*'|| 2 


and set Wk = £ — z' . In the second case set 


w k 


n k 

ll€ — 2/o|| 


(£ - yo ) ■ 


In each case we have 


IKII 2 yoll 2 > ^ - U - yoll 2 > k, 

so ||tUfc|| — » oo as k — > oo. We now apply Corollary 6.2.12 to the normed lin- 
ear functionals x ( x,Wk ) on H , to produce £ 0 € H such that the sequence 

{|(?o> w k)\} k^i is unbounded. 

By our hypotheses, there exists M > 0 such that |(£ 0 ,£ — z)\ < M for all z € C. 
Choosing K such that M < |(£ 0 i w if)| , suppose that \ nK = 0. Then, by our con- 
struction of the vectors Wk , there exists z £ C such that wk = £ — z and therefore 
|(Coi' u; ^)l = |(£o>£ — z )\ < M, a contradiction. Hence X nK = 1. □ 


We shall use some of these results on locatedness in the next section, when we 
look at adjoints of operators. 


6.3 Adjoints 


Throughout the rest of the chapter, H will be a Hilbert space. By an operator on 
H we mean a linear mapping of H into itself. 

We observed in Section 3 of Chapter 5 that the constructive Riesz representation 
theorem, with its requirement that the linear functional be not just bounded but 
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normed, does not enable us to prove that every bounded operator on H has an 
adjoint. In fact, we showed that the proposition “every bounded operator on a 
Hilbert space has an adjoint” implies LPO. 

Are there general criteria for the existence of the adjoint? To answer this ques- 
tion affirmatively, we first prove a lemma. 

Lemma 6.3.1. Let T be an operator on H , and B a subset of H. Let a be a unit 
vector in H , and P the projection of H on the 1-dimensional subspace Ka. Then 
PT{B) is located if and only if the set {(Tx,a) : x £ B} is located in K. 

Proof. For all x,y £ H, since (I — P)y is orthogonal to P(H), we have 

II V ~ PTxf = || Py- PTxf + ||(I - P) yf 

= || (Py,a)a- {PTx,a)af + \\{ 1 - P)yf 
= \(y,a)-(Tx,a)\ 2 + \\(l-P)y\\ 2 . 

Hence 

inf \\y-PTx\\ 2 

x£B 

exists if and only if 

inf \(y, a) - (Tx,a) | 2 

x£B 

exists, from which the desired conclusion follows. □ 


Proposition 6.3.2. Let T be a jointed operator on H. Let B be the unit ball of H, 
and P the projection of H on a 1-dimensional subspace. Then PT{B) is located. 


Proof. Choose a unit vector a £ H such that Px = (x, a) a for all x £ H. By the 
preceding lemma, it suffices to prove that the set 


S = {( Tx,a ) : x £ B} = {( x,T*a ) : x £ B} 


is located in K. We do this by showing that S is dense in the (located) ball 
H k (0, ||T*a||). Given e > 0, we have either ||T*a|| < e, in which case |£ — 0| < e for 
all C, £ IK with |£| ^ ||T*a||, or else T*a ^ 0. In the latter case, for each (gK with 
C ^ ||T*a|| we have 


c = 


HT*a|| 


T T*a,T*a 


and 


c 

||T*af 


T*a £ B. 


Thus in either case there exists x £ B such that |£ — (x,T*a}\ < e. Since e > 0 is 
arbitrary, it follows that S is dense in (0, ||T*a||) . □ 
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Proposition 6.3.3. Let H be a Hilbert, space with unit, ball B , and T a bounded 
operator on H such that PT{B) is located for each 1-dimensional projection P. 
Then T has an adjoint. 

Proof. By Lemma 6.3.1, for each unit vector y £ H, the set 

C = {( Tx,y ) : x £ Bj 

is located in K. Since T is bounded, C is a subset of a ball in K. It follows from 
Corollary 4.1.7 and Proposition 2.2.10 that C is totally bounded. Thus, by Propo- 
sitions 2.2.6 and 2.2.5, the linear functional u defined on H by 

u(x) = ( Tx,y ) (x £ H) 

is normed. By the Riesz representation theorem, there exists a unique element T*y 
of H such that (Tx, y) = ( x , T*y ) for all x £ H. □ 

Theorem 6.3.4. Let T be a jointed operator on a Hilbert space H. Then the image 
under T of the unit ball is located. 

Proof. By Proposition 6.3.2 and Lemma 6.3.1, for each x £ H the set 

{(Ty,x) :y £ B} 

is located in K. This set is also bounded, since 

\(Ty,x)\ = \(y,T*x)\ ^ ||T*ar|| 

for all y £ B. Hence, as in the previous proof, it is totally bounded. Since the map 
f Re £ is uniformly continuous on K, we now see that 

sup {Re (. x,Ty ) : y £ B} = sup {Re ( Ty,x ) : y £ B} 

exists for each x £ H. It follows from Theorem 6.2.13 applied to C = T(B) that 
T(B) is located in H. □ 


Theorem 6.3.5. Let T be a bounded operator on a Hilbert, space H that, maps the 
unit, ball to a located set.. Then T has an adjoint. 

Proof. Given y £ H, we take C = T(B) in Corollary 6.2.9 to show that the linear 
functional x (Ty,x) is normed. The result now follows from the Riesz represen- 
tation theorem, as in the proof of Proposition 6.3.3. □ 


Thus for a bounded operator on a Hilbert space, the existence of the adjoint is 
equivalent to the image of the unit ball being located. 

When is a jointed operator bounded? The classical answer is “always”; the 
constructive answer is less decisive. 
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Theorem 6.3.6. ( Helling er-Toeplitz theorem) Every jointed operator on a 
Hilbert space is sequentially continuous. 


Proof. Let T be a jointed operator on a Hilbert space H. It will suffice to prove that 
T* is sequentially continuous, since we can then interchange T and T* to obtain the 
desired result. Accordingly, let (% n ) n>1 be a sequence converging to 0 in H, and let 
e > 0. By Ishihara’s second trick (Lemma 3.2.2), either ||T*a; n || < e eventually or 
else there exists a strictly increasing sequence (nk) k ^ 1 of positive integers such that 
||T*x nfe || > e/2 for all k. In the latter case, passing to a subsequence if necessary, 
we may assume that 


\\ x nic II 


£ 

2k' 2 3 k 


for each k. Then, setting 


2k3 k 

Uk — 7 X rik > 


we have ||yfc|| < 1/k and ||T*j/ fc || > k3 k . We now apply Royden’s uniform bounded- 
ness theorem (Theorem 6.2.11) to the normed linear functionals x (■ x,T*y k ) on 
H , to construct a vector x € H such that \(x,T*yk)\ > k for each k. Since 


(■ x,T*y k ) = ( Tx,y k ) < ||Tx|| \\y k \\ — ► 0 as k — > oo, 
this is absurd. We conclude that ||T*x„|| < £ for all sufficiently large n. □ 


A subset S of N is pseudobounded if lim^^co n~ 1 s n = 0 for each sequence 
(s„)„^i in S. The following principle is trivially true in classical mathematics, 
holds in both INT and RUSS, and appears not to be provable in BISH (see [59]). 


BD-N Every inhabited , countable, pseudobounded set of positive integers 
is bounded. 

We now prove that if “sequentially continuous” can be replaced by “bounded” in 
the conclusion of Theorem 6.3.6, then BD-N holds. 

Let 

A = {ai,a 2 , . • •} 

be a countable pseudobounded set of positive integers. Let H be an infinite- 
dimensional Hilbert space with an orthonormal basis (e„) n>1 of unit vectors. We 
first prove that for each x £ H, 

OO 

Tx = ^2a n (x, e n ) e n 

n—1 

is well defined. Let (N k ) k ^ l be a strictly increasing sequence of positive integers 
such that 
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E l(z,e n )| 2 < 2 k 1 k 2 , 

n=N k 

and construct a binary sequence (Afc)fc^i such that 

Nk+i — 1 

E fl nK X,e n )f <2~ k , 

n=N k 
N k + i—l 

E a 2 \(x,e n )\ 2 >2- k - 1 . 

n=N k 

Define a sequence (b k ) k ^ l in 4 as follows. If A& = 0, set bk = oj. If A k = 1, then 

Nk+i — 1 

2 _fc ~ 1 fc _2 max{a 2 : N k < n < N k+1 } ^ E a n e «)| 2 > 2 _fe_1 , 

n=N k 

so a n > k for some n with N k ^ n < N k + 1 ; in this case we set b k = a n for this 
n. Since A is pseudobounded, there exists m such that b k /k < 1 for all k ^ in. 
If A k = 1 for some k ^ m, then 1 < < 1, a contradiction; hence A& = 0 for 

all k ^ m, and therefore the series defining Ta; converges. It is easily seen that T 
is a one-one selfadjoint linear mapping of H onto itself; so, by Theorem 6.3.6, it 
is sequentially continuous. But if T is bounded, then the pseudobounded set A is 
bounded (by any positive bound for T). 


Afc = 0 

Afe = 1 


6.4 Functions of Selfadjoint Operators 

We say that a sequence (S n ) n>1 of operators on our Hilbert space H converges 
strongly to an operator S if 

Sx = lim S n x 

n— >oo 

for each x £ H. Given an operator T on H , for each power series 

OO 

p(t) = E Cnin ( 6 - 15 ) 

n — 0 

with complex coefficients c n we can form the corresponding power series in T , 

OO 

p(T) = E c " T ”’ 

n — 0 

which makes sense provided the series on the right converges strongly. In particular, 
if p{t) is a polynomial, regarded as a power series whose coefficients are eventually 
0, then p(T) is always defined. 
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If each coefficient c n in (6.15) is real and T is selfadjoint, then p{T) is also 
selfadjoint. Our first aim is to prove that if, in that case, T has bound 1, then p(T) 
is bounded by the sup norm of p on the interval [0, 1]. This will require us to prove 
some lemmas about approximate eigenvalues. 

Note that two operators S,T on H are said to commute if ST = TS. Also, 
T° = I and T" = TT 71 " 1 for any n ^ 1. 

Lemma 6.4.1. Let T be a selfadjoint operator on H with bound 1, and let x be a 
unit vector in H. Let 



1 1 

< 1 \ 

v = Tx — {Tx, x) x, 

U = X + -V 

[ X+ 2 V ) 


Then 

(Tu, u) > (Tx, x) + i |M| 2 . 

Moreover, if S is any operator that commutes with T, then ||jSu|| ^ 2 ||Sar|| . 


Proof. Clearly, v £ and Tx — v _L {a;} 2- , so (by Theorem 4.3.1) v is the 

projection oi Tx onto {x} . Hence 

1 2 i 

X+-V = 1+ - IMI 2 > 0, 

and our definition of u makes sense. Also, 

{Tx, v) = {Tx, v) - {{Tx, x) x, v) = ||u|| 2 
and therefore, since T = T* , 
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since ||u| < 2. 

Finally, let S commute with T. Then 


\\Su\\ < 


S [x+^v 


Sx — — (Tx, x) Sx + -T/Sx 


< ( 1 + \ \( Tx > x )\ + \ ) IM| < 2\\Sx\\ . 


□ 


We show how to construct approximate eigenvectors common to finitely many 
commuting selfadjoint operators. We deal first with the case of a single selfadjoint 
operator. 


Lemma 6.4.2. Let T be a selfadjoint operator on H with bound 1, and let x\ be a 
unit vector in H. Let e be a positive number, and N an integer greater than 32/e 2 . 
Define sequences (x„) n>1 , (j/n) ra>1 in H recursively by 


Vn — Tx n (Tx n , X n ) X n , X n +\ 





Then (Tx n + i,x„+i) ^ ( Tx n ,x n ) for each n, and there exists n ^ N such that 

WVn || < £• 


Proof. By Lemma 6.4.1, 


(Tx n +] , X'n-\-\) (Tx n ,Xn) ^ ^ ||t/n|| 

On the other hand, either ||y n || < e for some n ^ N, or else ||j/ n || > e/2 for all 
n ^ N. In the latter case, Lemma 6.4.1 shows that 

e 2 

(Tx n + ( Tx n , x n ) ^ 


for each n ^ N, and therefore that 


(Txjv+i,xjv+i) ^ (Tx i,x{) + 


Ne 2 

HT 


>-1 + 2 = 1 , 


which is absurd since xjv+i is a unit vector and T has bound 1. 


□ 
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Lemma 6.4.3. Let Tj,, . . ,T n be commuting selfadjoint operators on H with com- 
mon bound 1, and let x be a unit vector in H. Then for each e > 0 there exists a 
unit vector u such that ( Tjw, w ) ^ (Tix,x) — e and 

|| T k u - ( T k u , it) it|| < e (1 < k < n) . 

Proof. Noting that the case n— 1 has been disposed of in Lemma 6.4.2, let n > 1 
and suppose that we have proved the desired result for n — 1 commuting selfadjoint 
operators. Fix e > 0, and choose a positive integer N > 32 /e 2 . By our induction 
hypothesis, there exists a unit vector X\ such that 

(TxXi,Xi} ^ (Tjx,x) - | 

and 

\\T k xi - (T k x i, aii) a;i|| < ^ (1 < k < n- 1) . 

Taking T = T n , define sequences {x k ) k>1 and (y k ) k>1 as in Lemma 6.4.2. By that 
lemma, there exists j (1 ^ j ^ N) such that || T n Xj — ( T n Xj,Xj ) ary || < e. Setting 
u = Xj, we see that if j = 1, then we are finished; so we may assume that j > 1. 
Then for 1 ^ fc ^ n — 1, since the selfadjoint operator T*, — (T k xi,X\) I commutes 
with T„, we see from the final part of Lemma 6.4.1 that 

|| T k u- (T k xi,xi)u\\ < 2 \\T k Xj-\ - (T k xi,xi) Xj^i\\ 

< ... 

2 J ~ 1 \\T k xi - (T k xi,xi) xx\\ 

< 2 W - 1 ^ = |. (6.16) 

On the other hand, noting that for all z £ H and all a, b € R, 

|| z — au\\ 2 — || z — bu\\ 2 = (a — b) (a + b — 2 Re (z, u )) , 


we have 

\\T k u- (T k u,u)u\\ 2 - \\T k u — (T k xi,xi) u\\ 2 

= ((TfcW, u ) - (TfcXi, xi)) ((Tfeir, u) + (TfcXi, xi) - 2 (T fc u, u)) 
= - (( T k u,u ) - (T k xi,x\}) 2 
< 0 . 

It follows from this, (6.16), and our choice of u that ||Tfcit — (T k u,u)u\\ < £ for 
1 ^ k < n. Finally, again using (6.16), we see that 

{Tiu,u) 3s (Tixi,xi) - |((Ti - (Tixi.xi)) u,u) \ 

> (Tixi,xi) - ||Tjw — (Tixi,xi) «|| 

> (Tix,x) - | | 

= (Tjx, x) — e. 
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Our induction is now complete. 


□ 


Proposition 6.4.4. Let T be a self adjoint operator on H with bound 1. Then for 
each polynomial p with real coefficients, the operator p(T) has bound 

M = sup{|p(t)| : 0 ^ t < 1} . 


Proof. Write 


N 


pit) = ]Tc n r, 

n = 0 


where each c n £ R. Let a; be a unit vector in H, and let e > 0. Since p(T) has only 
real coefficients, it is selfadjoint. By Lemma 6.4.3, there exists a unit vector u £ H 
such that (p(T)u, u) ^ (p(T)x, x) — £ and 


|| Tu — ( Tu , u) w| < 


1 + 


N 

n= 1 


-1 


e. 


Taking t — ( Tu,u ), for n > 1 we compute 

||T n it - t n u\\ s? || T (T n ~ 1 u - t n ~ l u ) || + Itr 1 ||Tu - tu\\ 

< ||T n - 1 «-t n - 1 «|| + ||Tu- to|| 

< ■■■ 

^ n || Tu — fu|| . 


Hence 


N 


I \p(T)u - p(t)u\\ < ^2 |c„| \\T n u - t"u|| 

n— 1 

< (X>|c„|j \\Tu- (Tu,u)u\\ < e 


and therefore 


(p(T)x,x) < (p(T)u,u)+e 


< \p(t ) I + I (( p(T)u - p{t)u) ,u)\+£ 

< M + ||p(T)u — p{t)u\\ < M + 2e. 


Since e > 0 is arbitrary, it follows that (p(T)x, x) ^ M. Replacing p by p 2 , we now 
have 


\\p(T)x\\ 2 = (p{T) 2 x,x) = (p 2 (T)x,x) 

< sup j|p(f)| 2 : 0 < t < lj = M 2 


and therefore ||p(T)x|| < M. 


□ 
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An operator T on H is said to be positive if (Tx,x) ^ 0 for all x £ H. In that 
case, T is selfadjoint, since for all x,y £ H we have both (Tx, x) = (x,Tx) and the 
polarisation identity 

4 ( Tx , y) = (T (x + y ) , x + y) - (T (x - y ) , x - y) 

+ i (T (x + iy) , x + iy) - i (T (x-iy),x- i y) 

(which is actually valid for any operator T on H). We introduce a partial order on 
the set of selfadjoint operators by writing S' ^ T, or equivalently T ^ S, if and only 
if T — S is a positive operator. 

For our first result about positive operators we use a weak constructive substi- 
tute for the classical result that a series of positive real numbers is convergent if its 
partial sums form a bounded sequence. 

OO 

Lemma 6.4.5. Let ^ a n be a series of nonnegative terms whose partial sums form 

n— 1 

a bounded sequence. Then for each £ > 0 and each positive integer n there exists 
k ^ n such that a*, < e. 

Proof. Let b > 0 be an upper bound for the partial sums of the series. Given e > 0, 
choose a positive integer N such that Ne/2 > b. If a*, > s/2 for n < k < n + N , 
then 

ra+JV n+N , r 

- Ne 

ak> 2^ a fc > ~2~ > b, 

k—1 k=n + 1 

a contradiction. Hence ak < £ for some k with n < k n+ N. □ 


Proposition 6.4.6. Let S and T be commuting positive operators on H 1 with S 
bounded. Then ST is positive. 

Proof. The proof is similar to the classical one on pages 415-417 of [4], so we give 
only an outline, leaving the details as an exercise. Since S is bounded, we may 
assume that 0 ^ S < I. Define a sequence (5'n)„>i of bounded selfadjoint operators 
on H such that 

S\ = S, S n+1 = S n ~S 2 n (n ^ 1) . 

By induction, 0 ^ S n ^ I for each n. We now have 

n n 

0 < E s k = E s *+i) = 5 ^ 

k=l k = 1 

It follows that for each x E iJ, 

n n 

= ^2( s k x : x ) = (( s - S n+ i)x,x) < (Sx, x) . 

k = 1 k = 1 
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By Lemma 6.4.5, there exists a strictly increasing sequence (ni) i>1 of positive in- 
tegers such that || S Ui x || < 2 _i for each i; whence 


S k x = S x — S ni + \x — > Sx as i — * oo. 

fe= l 

Since T commutes with S, it commutes with every S n . Hence 

( STx,x ) = ( TSx,x ) = / T ( lim S%x\ ,a:\ . 


fc= l 


Now, T is positive and hence selfadjoint; so, by Theorem 6.3.6, it is sequentially 
continuous. Hence 


(. STx,x ) = lim (TS k x,x) 

i — >oo * ^ 
k = 1 

Tli 

= lim V (S k TS k x,x) 

i —> oo z ^ 
fc=l 

Tli 

= lim y (TS k x, S k x) ^ 0. 


ferrl 


Since a; is arbitrary, we conclude that ST ^ 0. 


□ 


Our next objective is to construct the positive square root of a bounded positive 
operator. We first examine an iteration scheme for the function t y/t on [0, 1] . 
A standard classical proof of the convergence of that scheme uses Dini’s highly 
nonconstructive theorem on the uniform convergence of monotone sequences of 
continuous functions [47] (pages 131-132). Fortunately, with relatively little extra 
effort, we can avoid using Dini’s theorem altogether. 

Lemma 6.4.7. Define a sequence ( u n ) n>1 of uniformly continuous mappings from 
[0, 1] into R. iteratively by 

Ul(t) = 0, U n+1 (t) = U n (t) + ^ (t - U nW) ■ 

Then (u n (t)) n>1 converges to \/t uniformly on [0,1]. 

Proof. For each t £ [0,1] and each n, 

Vt - u n+ i{t) = y/t- Unit ) - ^ (t - Unit)) 

= (Vt ~ Unft )) (>/< + Unit)') 


(6.17) 
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Using this, it is simple to prove by induction that u n + 1 ^ u n and u n {t) ^ \Jt. Let 
0 < £ < 1, and compute a positive integer N such that 


1 - 


2V2 


< e (n ^ N) . 


Consider any t £ [0, 1] . If t < £ 2 , then for all n we have 0 ^ u n (t) < \ft < e and 
therefore 

0 ^ \/t — u n (t) < e. 


If t > e 2 /2, then for all n ^ 1, 

1 
2 


t + u r . 


and therefore 


1- - [Vt + llr 


1 r £ 
> 2' /t> 2Vi 


< 1 - 


2 V / 2‘ 


It follows from this and (6.17) that for all n ^ 2N, 




< ( 1 

( 1 


H 1 

< £. 

Thus (6.18) holds for all n ^ 2N. 


2\[2 


2V2 


2\[2 


t Un— 1 


t - U n 


i-N 


N 


t — un 


(6.18) 


□ 


Our next lemma will enable us to transform the iteration in Lemma 6.4.7 into 
one for the square root of a bounded positive operator. 

Lemma 6.4.8. Let T be a selfadjoint operator on H that satisfies 0 ^ T ^ I, define 
the sequence (u n ) n>1 as in Lemma 6.4.7, and write U n = u n (T). Then for each n 
we have 0 ^ U n ^ \ (I + T ) , U 2 ^ T, and 

T - U 2 n+1 = (T - U 2 n ) (i - \ ( U n+1 + U n )j ■ 


(6.19) 
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Proof. We have 

T-Ul +1 =T-U 2 n + (Ul-Ul +1 ) 

= T~U 2 ^ ( U n+1 - U n ) (U n+1 + U n ) 

= T-Ul- l -(T-Ul) (U n+1 + U n ) 

= (T-C/2) (i _ I ([/ n+1 + [/„)) . 

Suppose that for some n we have 0 < U n < \ (I + T) and U% < T. (These inequal- 
ities certainly hold for n = 1.) Then 0 ^ U n ^ I, so U n and l — U n are positive and 
hence selfadjoint, and (I — U n ) 2 ^ 0. Thus 

U n +i = - (I + T) - -I + ^ U n - — U^j 

= \(l + T)- 1 -(I~U n ) 2 <±(1 + T). 

On the other hand, 

U n+1 -U n = \(T-Ul) ^0 
and therefore U n+ \ ^ U n ^ 0. Thus 

0 ^ — {U n + 1 + U n ) ^ U n+1 ^ - (I + T) ^ I, 

and so 

I — 2 (U n +i + U n ) ^ 0. 

Hence T — U% +1 ^ 0, by (6.19) and Proposition 6.4.6. This completes the induction. 

□ 


Proposition 6.4.9. Let T be a bounded positive operator on H. Then there exists 
a unique positive operator U on H such that U 2 = T. Moreover, U is bounded, U 
commutes with every operator that commutes with T, and the range of U is dense 
in the closure of the range of T. 

Proof. We may assume that 0 ^ T < I. Define ( u n ) n>l and ( U n ) n>1 as in Lemmas 
6.4.7 and 6.4.8. By Lemma 6.4.8, 0 ^ U n ^ I. Since U n is a polynomial in T, 
it commutes with every operator that commutes with T. By Lemma 6.4.7, for 
each £ > 0 there exists N e such that \u m (t) — u n (t)\ < e for all t € [0, 1] and all 
m, n ^ N e . It follows from Proposition 6.4.4 that \\U m x — U n x\\ < £ ||a;|| for all 
x £ H and all m, n ^ N e . Hence ( U n ) n>1 converges strongly to an operator U on 
H. Clearly, 0 ^ U ^ I and U commutes with every operator that commutes with T. 
Moreover, for each n ^ 1, since u n {t) is a strict polynomial (one without constant 
term) over R, we have ran (U n ) C ran(T), from which it follows that 
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ran (U) C ran (T). (6.20) 

With £ and N e as before, we see that 

\ u n(t)~t\ = Un(t) + Vt U„(t) - Vt 

< 2 Un{t) - Vt 

= 2 lim \u n (t) - u m (t)\ 

m—* oo 

< 2e 

for all t £ [0, 1] and all n ^ N e . It follows from Proposition 6.4.4 that 

\\(UZ-T)x\\ <2e ||*|| 
for all x £ H and all n ^ N e . Hence 

||(^ 2 -r)*|| < \\( u 2 - u 2 ) x \\ + \\( u 2 - t ) x \\ 

= \\(U + U n )(U-U n )xW + \\(UZ-T)x\\ 

< 2\\{U -U n )xW + \\(Ul-T)x\\ 

— > 0 as n — > oo 
and therefore U 2 = T. Moreover, 

Tx = U ( lim U n x\ , 

\n— ► oo / 

which, taken with (6.20), shows that ran(U) is dense in the closure of ran(T) . 
Now suppose that we have another positive operator S such that S 2 = T. Then 

ST = SS 2 = S 2 S = TS, 

so S commutes with T, and therefore U commutes with S. Given x £ H, write 

y = Ux — Sx. 

Then 

(Uy, y ) + ( Sy , y) = ((U + S) (U - S ) x,y) = ( ( U 2 - S 2 ) x, y) = 0. 

But ( Uy,y ) ^ 0 and ( Sy,y ) ^ 0, so we must have ( Uy,y ) = 0 = ( Sy,y ) . Now 
apply the first part of the proof to U, to obtain a positive operator A such that 
A 2 = U. Then 

\\Ay\\ 2 = (Ay, Ay) = ( A 2 y,y ) = ( Uy,y ) = 0, 
so Ay = 0. Hence Uy = A 2 y = 0, and similarly, Sy = 0. It now follows that 
|| Ux - Sxf = ((U -S)(U- S) x, x ) = {(U - S) y, x ) = 0. 

Hence S = U. □ 

The operator U in Proposition 6.4.9 is called the square root of T and is denoted 
by VT or T 1 / 2 . 



180 6 Operators and Locatedness 


6.5 Locating the Kernel and the Range 

Let T be a jointed operator on the Hilbert space H. In this section we deal with 
questions like the following: 

When is ran (T) located? Is the locatedness of ran (T) linked to the locat- 
edness of ker(T) or that of ran (T*)? When is ran (T) closed? 

As we shall see, the answers depend on some interesting applications of the A- 
technique. We begin with two elementary lemmas. 

Lemma 6.5.1. Let X and Y he orthogonal subspaces of a Hilbert, space H such 
that. X + Y is dense in H. Then both X and Y are located. 

Proof. Given z £ H, choose a sequence (x n ) n>1 in X and a sequence (y n )„>i i* 1 L 
such that x n + y n — > z as n — > oo . For m ) n we have 

\\x m - x n \\ 2 + \\y m - y n \\ 2 = \\{x m - x n ) + (y m ~ y n )\\ 2 

= \\{x m + y m ) — (x n + Vn )\\ 2 
— » 0 as n — > oo. 

Hence (x n ) n>1 , (y n ) n >i are Cauchy sequences in X , Y respectively, and so converge 
to respective limits Xoo, y <*, in H such that 2 = Xao + poo, Xoo A. j/oo and _L X. 
Given x £ X, we have 

\\ z - £|| 2 = \\y°° + (Xoo - £r ) || 2 = Hz/oo || 2 + II Xoo - x\\ 2 ^ ||yoo || 2 , 

with equality when x = x^. It follows that p(z,X) exists and equals 1 1 2/oo 1 1 - Hence 
X, and likewise Y, is located. □ 

Lemma 6.5.2. Let T be a jointed operator on H. Then ran(T) ± = ker(T*). Also, 
ran (T) is located if and only if ran (T) + ker (T*) is dense in H , in which case 
ker (T*) is located. 

Proof. First observe that 

y ± ran (T) <=> \/x £ H ((x, T*y ) = 0) T*y = 0, 

so ran(T)- 1 - = ker(T*). It follows that if ran(T) is located, then so is ker(T*); 
moreover, if P is the projection of H on the closure of ran(T), then since 
x = Px + (I — P) x for each £ H , we see that ran(T) + ker (T*) is dense in H. 
If, conversely, ran(T) + ker (T*) is dense in H , then we can apply the preceding 
lemma to show that ran(T) and ker (T*) are both located. □ 
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Lemma 6.5.3. IfT is a jointed operator on H , then ran (TT*) is dense in ran (T) . 

Proof. Let B denote the closed unit ball of H. By Theorem 6.3.4, TT*(B) is located, 
so TT*(nB) is located for each positive integer n. Given x G H and e > 0, we need 
only show that p ( Tx , TT* ( B n )) < e for some n. To this end, construct an increasing 
binary sequence (A„) n>1 such that 

A„ = 0 =>p(Tx,TT* (■ nB )) > 

A n = 1 => p (Tx, TT* (nB)) < e. 

We may assume that Ai = 0. If A„ = 0, then, applying the separation and the Riesz 
representation theorems, we construct a unit vector y n £ H such that 

(TT*z, y n ) + | < (Tx, y n ) (z <E nB) . 

Then 

n\\T*y n \\ 2 < (TT*(ny n ),y n ) + | < ( Tx,y n ) < ||Ta;|| 

and so ||T*y„|| 2 < ?i _1 ||Tx|| . If A„ = 1, we set y n = 0. Clearly, the sequence 
(T*y n ) n>1 converges to 0. Choose a positive integer N such that ( x,T*y N ) < e/2. 
If A at = 0, then 

^ ^ + (TT* (Ny N ),y N ) < ( Tx,y N ) = (x,T*y N ) < 

a contradiction. Hence A^v = 1. □ 


For a first application of Lemmas 6.5.2 and 6.5.3, we call an operator T on H 
sequentially open if for each sequence (x n ) n ^ 1 such that (Tx n ) n>1 converges to 0, 
there exists a sequence (y n ) n > i hr ker(T) such that x n + y n — > 0. 

Proposition 6.5.4. Let T be a sequentially open operator on H with an adjoint. 
Then ran (T) and ker (T*) are located. 

Proof. By Lemma 6.5.3, there exists a sequence (x n ) n>1 in H such that 

T (T*x n — x) — * 0 as n — * oo. 

Since T is sequentially open, there exists a sequence (y n ) n > i in ker (T) such that 
T*x n + y n — > x as n — > oo. We now see that ran (T*) + ker (T) is dense in H\ 
whence, by Lemma 6.5.2, ran (T) is located. □ 


If T is any bounded operator with an adjoint, then T*T is a bounded positive 
operator, so, by Proposition 6.4.9, the absolute value of T, 

|Tj = VT*T, 

exists as a bounded positive operator. 
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Proposition 6.5.5. IfT is a bounded operator on H with an adjoint, then ran (|T|) 
is dense in the closure of ran (T*) . 

Proof. We see from Proposition 6.4.9 that the range of \JT*T is dense in the closure 
of the range of T*T. Reference to Lemma 6.5.3 completes the proof. □ 


We have already seen, in Lemma 6.5.2, that, for a jointed operator T, the kernel 
of T* is the orthogonal complement of the range of T, and that if ran (T) is located, 
then so is ker (T*) . If, conversely, ker (T*) is located, is ran (T) located also? Clas- 
sically it is, since the closure of ran (T) is the orthogonal complement of ker (T*) . 
However, if the latter holds constructively, then we can prove Markov’s principle: 
for if a is any real number such that -> (a = 0) , then the operator T : z az on C is 
selfadjoint and has kernel {0} ; but if ran (T) is located, then -> {p (1, ran (T)) > 0), 
so p(l, ran(T)) = 0, ran (T) contains nonzero elements, and therefore a^O. 

We say that a sequence (x n ) n>1 in H converges weakly to x £ H, and we write 

W 

x n — > x as n — > oo 

(or just x n x ), if linx^oo ( x n ,y ) = (x,y) for all y £ H. We call an operator 

T on H weak- sequentially open if for any sequence (x n ) n>1 such that Tx n — > 0, 

there exists a sequence (y n ) n > i hr ker(T) such that x n + y n 0. 

Proposition 6.5.6. IfT is a jointed operator on H such that ran(T*) is located, 
then T is weak-sequentially open. 

Proof. Let P be the projection of H on the closure of ran(T*). Let {x n ) n>l be a 
sequence in H such that Tx n — > 0, and for each n set y„ = Px n — x n . Then 
y n £ ran(T*) J - = ker(T). For each z £ H we have 

{x n + y n , T*z) = (x n ,T*z) - ((I - P)x n , PT*z ) = <: Tx n , z) , 


so 

I {x n + Un,T*z ) | < ||Tx n || ll^ll — + 0 as n — + oo. 

It follows that for each x £ H we have (x n + y n , Px) — > 0 and therefore 

( X n Pm X) {P ( X n -\- y n ) , x) — ( X n Pm Px') > 0. 

Hence T is weak-sequentially open. □ 


Theorem 6.5.7. Let T be a weak-sequentially open, jointed operator on H with 
located kernel. Then T* has located range. 
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Proof. Let P be the projection of H on ker(T). It suffices to show that for each 
x £ H, the vector x — Px is in the closure of ran(T*): for then 

\\ X -yf=\\( X -P X )+yf + \\P X f 

for all y € ran(T*), so p(x,ran(T*)) exists and equals ||Px||. Accordingly, fix x in 
H and e > 0. Denote the closed unit ball in H by B. By Theorem 6.3.4, T*(nB) is 
located in H for each positive integer n; so we can construct an increasing binary 
sequence (A n ) n>1 such that 

A ra = 0 => p(x — Px, T*(nB)) > |, 

A n = 1 => p(x — Px,T*(nB)) < e. 

Without loss of generality we may assume that Ai = 0. If A„ = 0, then by Corollary 
5.2.10 and the Riesz representation theorem, there exists a unit vector y n such that 
for each z £ nB, 

{. x-Px,y n ) > \(T*z, y„)\ + | = \(z,Ty n )\ + |. 

Taking z = n\\Ty n \\ 1 Ty n , we obtain 

| +n\\Ty n \\ < (x- Px,y n ) < ||a:|| 

and therefore ||Tj/ n || < ?r _1 ||x|| . On the other hand, if A„ = 1 — A„_i, we set 
yk = 0 for all k ^ n. Clearly, the sequence (Ty n ) n>1 converges to 0. But T is 
weak-sequentially open, so there exists a sequence (z n ) n>1 in ker(T) such that 

Un + z n 0- Choose N such that 

£ 

\(x - Px,y n ) | = |(x - Px,y n + z n )\ < - 

for all N. If Aat = 0, then \{x — Px, y n )\ > e/2, which is absurd. Hence \n = 1. 
Since £ > 0 is arbitrary, we are through. □ 


Corollary 6.5.8. If T is a jointed operator on H , then the following four state- 
ments are equivalent: 

(a) ran (T) is located. 

(b) ran(T*) is located. 

(c) ker(T) is located and T is weak-sequentially open. 

(d) ker (T*) is located and T* is weak-sequentially open. 

Proof. This follows from Lemma 6.5.2, Theorem 6.5.7, and Proposition 6.5.6. □ 
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Turning in a slightly different direction, we move towards a proof of the closed 
range theorem: 

Theorem 6.5.9. Let T he a jointed operator on H whose range is closed. Then 
ran (T) and ker (T) are located, and ran(T*) is closed. 

To set the scene, consider how the closed range theorem is proved classically. 
One simple proof uses the polar decomposition of the operator T *; but the existence 
of an exact polar decomposition in constructive analysis requires the range of the 
operator to be located from the outset (Exercise 16). In another classical proof of 
Theorem 6.5.9 the idea is to show that ran(T*T) is closed and then to apply Lemma 
6.5.3. To that end, let (x n )„j>i be a sequence such that (T*Tx n ) n ^ i converges to 
a limit y € H. Applying the classical uniform boundedness theorem (see page 163) 
to the bounded linear functionals f„ defined on the Hilbert space ran(T) by 

fn(Tx ) = ( Tx,Tx n ) = {x,T*Tx n ) , 

we obtain M > 0 such that ||/ n || ^ M for each n. Hence the linear functional 
Tx (x,y) on ran(T) is bounded by M. By the Riesz representation theorem, 
there exists x^ € H such that 

(x,y) = (Tx,Tx oo) = (x,T*Txoo) 

for all x. It follows that y = T*Tx ^ € ran(T*T). 

This proof fails constructively in two places: first, in its use of the classical 
version of the uniform boundedness theorem, and second, in its application of the 
Riesz representation theorem, which requires the linear functional to be not just 
bounded but normed. Fortunately, as the following sequence of results will show, 
these difficulties can be overcome. 

We begin with two lemmas that prepare us for a general result about sequentially 
continuous linear mappings between normed spaces. 

Lemma 6.5.10. Let T : X — > Y he a sequentially continuous linear mapping 
between normed spaces, (x n )„^i a Cauchy sequence in X, and 0 < a < /3. Then 
either ||Tx n || < (3 for all n or else there exists n such that ||Tx n || > a. 

Proof. In view of the linearity of T, we may assume that (3 — a > 1. Choosing a 
strictly increasing sequence {N k ) k ^\ of positive integers such that \\x m — x n || < 
2- 3k for all m.,n ^ N k , write 

Sfc = max {||Tx n || : 1 < n < N k } . 

Construct an increasing binary sequence (Afc)^^i such that 

A fc = 0 =► Vj ^ k ( 8j <13- 2" 2j ), 

A fc = 1 =► 3j ^ k ( Sj > (3- 2~ 2 3 +l ). 
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We may assume that Ai = A 2 = 0. Now construct a sequence (Afc)fc^i in X as 
follows. If Afc+i = 0 or Xk = 1, set Zk = 0. If Xk+i = 1 and A k = 0, then 

HTsjvJK Sk < P ~ 2“ 2fe 

and Sk+i > (3 — 2 _2fe_1 , so we can choose j such that Nk < j < iV^+i and ||Txj|| > 
/ 3 — 2 _2fc_1 ; setting 

Zk — 2 (Aj ) 1 

we have ||, 2 fc|| < 2~ k . Moreover, 

\\Tz k \\=2 2k \\Txj-Tx Nk \\ 

>2 2k (\\Tx j \\-\\Tx Nk \\) 

> 2 2k (/ 3 - 2~ 2k ~ 1 - (/? - 2 ~ 2k )) = 1 

This completes the construction of a sequence (zk)k^i converging to 0 in X. By the 
sequential continuity of T, lirri/^oc T Zk = 0. Choose K such that ||T^fc|| < 1/2 for 
all k ^ K. If A k = 1, then there exists n ^ Nk such that 

\\Tx n \\ >/3-2~ 2n+1 >a. 

On the other hand, if A^ = 0 and there exists k ^ I\ such that Afc+i = 1 — Afc, 
then ||Tzfc|| > 1/2, a contradiction. Thus if A k = 0, then Afc = 0 for all k ^ K and 
therefore for all k, so ||Ta;fc|| < (3 for all k. □ 


Lemma 6.5.11. Let T : X — > Y be a sequentially continuous linear mapping 
between normed spaces, and (x n ) n ^i a Cauchy sequence in X. Then sup ra>1 ||Tx n || 
exists. 

Proof. In view of the previous lemma, it is enough to show that the sequence 
(||Tx„||) ra>1 is bounded. To do so, choose R > 0 such that ||x n || < R for all n. 
Taking a = 1 and f3 = 2 in Lemma 6.5.10, we may assume that there exists rq 
such that ||Tx ni || > 1. Set Ai = 0. Using Lemma 6.5.10 repeatedly, we construct an 
increasing binary sequence (A„) n ^i, and an increasing sequence (nk)k^i of positive 
integers, such that 


Afc = 0 => ||Tx„J| > k and n k > n k - 1, 

Afc = 1 => ( Tx n ) n ^\ is a bounded sequence and rik + 1 = nk- 

Assume that we have constructed Afc and nk- If Xk = 1, we set Afc+i = Afc and 
rjfc + i = ilk- If Afc = 0, then ||Tx nj . || > j for all j ^ k. We then apply Lemma 
6.5.10 to the Cauchy sequence ( Xj)j>n k ■ Either we obtain Uk+i > nk such that 
||Tx nfc+ i|| > k + 1, or else ||Txj|| < k + 2 for all j > nk- In the first case we set 
Afc+i = 0, and in the second, noting that (Tx n ) n ^ 1 is bounded, we set Afc+i = 1 
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and Uk+i = nk- This completes the inductive construction of the sequences (Afc) fe>1 
and (rifc) fc>1 . If Afc = 0, set Zk — k~ 1 x nic ; if A k = 1, set Zk = 0. Then ||zfc|| ^ R/k 
for each k, so Zk — > 0 and therefore, by the sequential continuity of T, Tzk — » 0. 
Choose K such that ||T 2 fc|| < 1 for all k > K. If A a = 0, then 

\\Tz k \\ = i ||T* n J| > 1, 

a contradiction. Hence Aa = 1 and so the sequence (||Tx n ||) n>1 is bounded. □ 


Proposition 6.5.12. A sequentially continuous linear mapping T : X — > Y be- 
tween normed spaces maps Cauchy sequences to Cauchy sequences. 

Proof. Given a Cauchy sequence (x n ) n ^i in X , choose a strictly increasing sequence 
{Nkjk^i of positive integers such that \\x m — x n \\ < 2~ k for all m, n ^ Nk- For each 
fc, the sequence ( x n — XN k ) n >N k a Cauchy sequence; so, by Lemma 6.5.11, 

Sfe = sup \\Tx n -Tx Nk \\ 

n^N k 

exists. Given £ > 0, we construct an increasing binary sequence (Xk)k^i such that 

£ 

Afc = 0 => Sfc > -, 

£ 

Afc = 1 => Sfc < — . 

We may assume that Ai = 0. If Afc = 0, choose j ^ Nk such that \\Txj — TxN k \\ > 
e/4, and set Zk = Xj — XN k - If A k = 1, set Zk = 0. Then ||, 2 fc|| < 2~ k for each k, so 
Zk — * 0. Since T is sequentially continuous, Tzk — > 0 and we can choose K such 
that llT^fcH < e/4 for all k ^ K. If Aa = 0, then ||Tza|| > e/4, which is absurd; so 
A a = 1 and therefore sa < e/2. It follows that for all j, k ^ N K , 

\\Txj~TxkW s? \\Txj-Tx Nk \\ + || Tx k -Tx Nk \\ < A + - = e. 

Since e is arbitrary, {Tx n ) n ^i is a Cauchy sequence in Y. □ 


We now move more directly towards the proof of the closed range theorem. 

Lemma 6.5.13. Let H be a Hilbert, space, and T an operator on H with an adjoint 
and closed range. Let (x n ) n ^i be a sequence in H such that (T*Tx n ) n ^i converges 
to 0. Then for all positive numbers a, (3 with a < /?, either ||Ta; n || > a for some n 
or else ||Ta; n || < (3 for all n. 
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Proof. Let (rik)k^i be a strictly increasing sequence of positive integers such that 

\\ T * Tx A\ < ( k hy (j> n k)- 

Taking n 0 = 0, for each k ^ 1 set 

s k = max{||Ta;i|| : rik-i < i ^ n fc }. 

Construct an increasing binary sequence (X k )k^i such that 

Afc = 0 => s k < (3, 

Afe = 1 => Sk > a. 

We may assume that = 0. Define a sequence (y k ) k ^i hr ran(T) as follows: if 
A k = 0, set yk = 0; if Afc = 1 — Afc_i, choose i with n k - 1 < i ^ n k and ||Ta;j|| > a, 
and set 

Vj = W^f Xi 

for all j ^ k. Then (y k )k ^ l is a Cauchy sequence: in fact, | ijj — y k \\ < l/(k+ 1) 
whenever j ^ k. Since ran (T) is closed in H and therefore complete, there exists 
z £ H such that {y k )k ^ l converges to Tz. Choosing a positive integer N such that 
||z|| < Na, consider any integer k ^ N. If Afc = 1 — Afc_i, then 

Tz = W^\\ TXi 

for some i with n k -\ < i ^ n k and ||Txj|| > a, so 

ja < ^UTaJill = ( Txi,Tz ) = ( T*Txi,z ) < t^\\z\\ < T 2 Na ^ 

rZ rZ rZ rZ rZ 

a contradiction. Hence Afc = Afc_i for all k ^ N. It follows that either Afc = 0 for 
all k, or else Afc = 1 — Afc_i for some k < N. In the first case, ||Tx n || < (3 for all n; 
in the second, ||Ta:j|| > a for some i with n k - i < i ^n k . □ 


Lemma 6.5.14. Under the hypotheses of Lemma 6.5.13, for all positive numbers 
a, (3 with a < (3, either ||Ta: n || > a for infinitely many n or else ||Ta: n || < j3 for all 
sufficiently large n. 

Proof. Let (n k )k^i be a strictly increasing sequence of positive integers such that 

\\ T * Tx i\\ < ff2 (j> n k)- 

Successively applying Lemma 6.5.13 to the sequences (x nk ,x nk +i, . . .) for k ^ 1, 
construct an increasing binary sequence (Afc)fcj>i such that 
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Afc = o => 3?; ^ n k (a < ||Txj||), 
Afc = 1 => Vi ^ n*(||T’a;i|| < /?). 


We may assume that Xi = 0. Define a sequence {y k )k^i in ran(T) as follows: if 
Afc = 0, choose i ^ n k such that ||Txj|| > a and set 

Vk = W^f Xi ' 

if Afc = 1 — Afc_i, set y : j = y k -\ for all j > k. Then \\yj — y k \\ ^ 2/k for all j ^ k: so 
(y k ) k >i is a Cauchy sequence in the complete space ran (T) and therefore converges 
to Tz for some z £ H. Choosing a positive integer N such that ||;t|| < Na, consider 
any integer k > N . If Afc = 1 — Afc_i, then 


Tz = 


( k - 1 )|| Txi 

for some i ^ n k _ i with ||Txj|| > a , so 


rTXi 


YZi a < YZ\\\ TXi \\ = ( Tx ii Tz ) = ( T * Tx u z ) 
^ ttIUII < —Na < i -a, 


(fc-i r (fc-ir 


k- 1 


a contradiction. Hence X k = Afc_i for all k > N. It follows that either Afc = 1 for 
some k ^ N or else Afc = 0 for all k. In the first case, ||Txj|| < (3 for all i ^ n k . In 
the second case, for each k there exists i ^ n k such that ||Txj|| > a. □ 


Lemma 6.5.15. Under the hypotheses of Lemma 6.5.13, ( Tx n ) n>l converges to 0. 

Proof. By Lemma 6.5.14, for each e > 0 either ||Tx n || > e/2 for infinitely many 
n or else ||Tx„|| < £ for all sufficiently large n. In the former case, passing to an 
appropriate subsequence, we may assume that ||Tx n || > e/2 and ||T*Tx n || < 1/n 2 
for all n. Applying the uniform boundedness theorem (Corollary 6.2.12) to the 
normed linear functionals Tx ( Tx,nTx n ) on the Hilbert space ran(T), we can 
find Tz £ ran (T) such that the sequence (\(Tz,nTx n )\) n>1 is unbounded. But 

\(Tz,nTx n ) I = \(z,nT*Tx n )\ ^ — llzll — * 0 as n — > oo. 

n 

This contradiction rules out the possibility that ||Tx ra || > e/2 for infinitely many 
n. Since e > 0 is arbitrary, the result follows. □ 


Lemma 6.5.16. Let H he a Hilbert, space, and T a jointed operator on H with 
closed range. Then ran (T*T) is complete. 
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Proof. First observe that, by Theorem 6.3.6, both T and T* are sequentially con- 
tinuous. It is easily seen using Lemma 6.5.2 that T* is one-one on ran(T). So we 
can define a linear mapping 

S : ran (T*T) — > ran (T) 

by setting ST*Tx — Tx for each x £ H. Lemma 6.5.15 shows that S is sequentially 
continuous. Consider any Cauchy sequence ( T*Tx n ) n ^i in ran (T*T) . By Propo- 
sition 6.5.12, (Tx n ) n ^i is a Cauchy sequence in the Hilbert space ran(T). Hence 
there exists x £ H such that Tx n — > Tx. The sequential continuity of T* now 
yields T*Tx n — > T*Tx. Thus ran(T*T) is complete. □ 

We can now give the proof of the closed range theorem. 

Proof. By Lemmas 6.5.3 and 6.5.16, ran (T*T) is both dense in ran(T*) and com- 
plete. Hence ran(T*) is complete and therefore closed in H. Moreover, for each 
x € H there exists y £ H such that T*x = T*Ty\ so 

x = Ty + {x - Ty ) , 

where Ty £ ran (T) and (by Lemma 6.5.2) 

x — Ty £ ker (T*) = ran(T) _L . 

It follows from Lemma 6.5.2 that both ran (T) and ker (T*) are located. Applying 
Corollary 6.5.8, we now see that ran(T*) and ker (T) are located. □ 


6.6 Baire’s Theorem, with Applications 

Baire ’s theorem states that 

The intersection of a sequence of dense open subsets of a complete metric 
space is dense in that space. 

The standard classical proof of this theorem (see [79], page 97) passes over un- 
changed to the constructive setting. However, various classically equivalent versions 
of Baire’s theorem do not pass over unscathed; for example, the version that states 
that if a complete metric space is the union of a sequence of closed subsets, then 
one of those subsets is inhabited. In this section we present a constructive proof of 
a restricted form of this last version of Baire’s theorem in the context of a Banach 
space, and apply it to operator theory. 

The proof of our version of Baire’s theorem introduces yet another technique, 
in which we show that a certain property P holds by constructing an element of 
the set {x £ X : P}, which is empty if -<P holds and inhabited if P holds. 
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Theorem 6.6.1. Let X be a Banach space, and C a closed, convex, balanced, lo- 
cated subset of X such that X = (J nC and the distance p (0, —C) exists. Then C° 

n^l 


is inhabited. 


Proof. For each positive integer n define the open set 

U n = —nC U {x £ X : C° is inhabited} , 

where — nC is the metric complement of nC in X. To prove that U„ is dense in 
X, consider y £ X and £ > 0. Note that p{y,nC) exists and equals np ( n~ 1 y,C ) . 
Either p(y,nC ) > 0 or p(y,nC) < e. In the first case, y £ —nC. In the second, 
choose z € nC such that ||y — z\\ < e. Noting that 

p (0, —2nC) = 2np (0, —C) 

exists, we see that either p(0,— 2nC) < 2e or p(0, — 2nC) > e. In the former case, 
choose z' £ —2nC such that || 2 /|| < 2e. For each w £ nC we have — w £ nC (since 
C is balanced), so 

z — w £ nC + nC = 2 nC, 
by the convexity of C. Hence 

|| (z - z') - io|| = ||/ - (z-w ) || > p{z' ,2nC) > 0. 

Thus z — z' £ — nC . Since also 

\\y-(z-z')\\<\\y-z\\ + \\z’\\<3e, 

we see that p{y,U n ) < 3e. Finally, in the case p(0,— 2nC) > e, for each x with 
||x|| < £ we have p(x,2nC ) = 0. Hence 

5(0, £) C 2nC = 2nC, 

so (2 nC)° , and therefore C°, is inhabited. In this case, U n = X. This completes the 
proof that U n is dense in X. 

Since X is complete, it follows from the standard version of Baire’s theorem 
that P| U n is dense in X and therefore, in particular, contains a point f. Choose 

n^l 

n such that £ £ nC. Since also £ £ U n , we must have 

£ £ {x £ X : C° is inhabited} . 

Hence C° is indeed inhabited. □ 


Recall the classical open mapping theorem for bounded linear mappings: a 
bounded linear mapping T of a Banach space X onto a Banach space Y is open , in 
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the sense that it maps open subsets of X onto open subsets of Y (or, equivalently, 
it maps the open unit ball of X onto an open subset of Y). This theorem does 
not hold constructively without some additional hypotheses: the inverse S of the 
mapping T in the example on page 169 is a bounded operator of a Hilbert space 
onto a Hilbert space, but if S is open, then T is bounded. In other words, if every 
bounded linear mapping of a Hilbert space onto a Hilbert space is open, then BD-N 
holds. 

Nonetheless, we can prove constructive versions of the open mapping theorem 
that are classically equivalent to the standard version. To that end, we begin with 
a lemma. 

Lemma 6.6.2. Let T be a sequentially continuous linear mapping of a Banach 
space X into a normed space Y such that T{B X { 0, 1)) is located. Let r be a positive 
number, and y an element of By (0,r). There exists x £ B y(0,2) such that if 
y ^ Tx, then p (z, T (By (0, 1))) > 0 for some z £ By (0, r). 

Proof. If p ( y , T (Bx (0, 1))) > 0, then we can take x to be any element of By (0, 2). 
So we may assume that p (y, T (Bx (0, 1))) < r/2. Choosing x\ £ Bx { 0,2) such 
that || y — T (5X1) || < r/2 and therefore ||2 y — Tx i|| < r, we set Ai = 0. This is the 
first step in the inductive construction of an increasing binary sequence (A„) n>0 
and a sequence (x n ) n>0 of elements of B x (0,2) such that for each n ^ 1, 

> if A„ = 0, then 


n— 1 


and 


2 n ~ X y - Y 2 n - 1 ~ i Tx i J , T (B x (0, 1)) ] < - 

i= 1 / 

n 

2 n y - Y 2 ” _i Tn 


< r ; 


> if X n = 1 — A ra _i, then 


n— 1 


2 n ~ X y - Y Txi I , T (B x (0, 1)) > 0 


and Xi = 0 for all i ^ n. 


Suppose that we have found A„_i and a; n _i with the applicable properties. If 
A„_i = 1, we set A„ = 1 and x n = 0. If A„_i = 0, we consider the two cases 


n—1 


2 n ~ 1 y - Y 2 n ~ 1 ~ i Txi , T (B x (0, 1)) > 0 


and 
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p( [2 n - 1 y-Y J ^~ 1 ~ i Tx i \,T{B x {Q,l)) \ < V ~. 


i= 1 


In the first case we set A n = 1 and x n = 0. In the second case we choose x n £ 
B x (0, 2) such that 


and set A„ = 0. Then 


2 n ~ 1 y — 2 n ~ 1 ~ i 'Txi — T 

2= 1 

n 

2 n y - ^ 2 n ~ i Txi 


1 


< r. 


< 


This completes the induction. 

OO 

Since X is complete, the series ^2 2 converges to a sum x G (0, 2) . Then 

2=1 

OO 

Tx = 2 ~ l Txi , by the sequential continuity of T.li y ^ Tx, then there exists N 

2=1 

such that 

N 


- ]T 2-n^ 


>2 ~ N r 


and therefore 


N 


2 N y - ^ 2 N ~ i Txi 


> r. 


We must therefore have A at = 1; so there exists n ^ N such that A„ = 1 — A„_i. 
Setting 


= 2 n - 1 y-J2 2n ~ 1 ~ i Tx i , 


i=l 


we see that p (z, T {B x (0, 1))) > 0 (as A n = 1) and ||z|| < r (as A n _i = 0). This 
completes the proof. □ 


Lemma 6.6.3. Let C be a balanced convex subset of a normed space Y, and let 
■y &Y and r > 0 be such that B(y,r) C C. Then B(0,r) C C. 

Proof. If 2 £ Y and ||£|| < r, then y±z £ C and therefore 2 = \ (y + z) — \ (y — z) £ 
C. □ 


Here is our version of the open mapping theorem. 

Theorem 6.6.4. Let X, Y be Banach spaces, and T a sequentially continuous linear 
mapping of X onto Y such that T (13(0, 1)) is located and p (0, —T ( B (0, 1))) exists. 
Then T is an open mapping. 
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Proof. Since 

Y = U « (r (5(0,1))) , 

n^l 

we can apply Theorem 6.6.1 to compute yo £ Y, R > 0, and a positive integer N 
such that 

B y (y 0 ,R) c N (t (5(0,1))) . 

Writing y\ = N~ 1 y 0 and r = R/N, we obtain 

B Y (yi,r) c T (5 (0,1)). 

It follows from Lemma 6.6.3 that 

B y (0, r) CT(6 (0, 1)). (6.21) 

Now consider any y £ Y with \\y\\ < r. Choose x £ 5(0,2) as in the con- 
clusion of Lemma 6.6.2. If y ^ Tx, then there exists z € 5y(0,r) such that 
p (z, T ( 5 (0, 1))) > 0, which contradicts (6.21); hence (the inequality on a normed 
space being tight) y = Tx. Thus 5y(0,r) C T(5(0,2)), and therefore T is an 
open mapping. □ 


A subset C of a linear space X is said to be a generating set for, or to generate, 
X if every element of A is a finite linear combination of elements of C. Our next 
lemma will enable us to prove that compactly generated Banach spaces are finite- 
dimensional. 

Lemma 6.6.5. Let G be a compact generating set for a nontrivial Banach space 
X. Then there exists a compact generating set C for X that is both convex and 
balanced such that p(0,— C) exists. 

Proof. We may assume that G is both convex and balanced (the proof is left as an 
exercise). Now, 2 G is compact, the mapping x p{x,G) is uniformly continuous 
on X , and X is nontrivial. Hence there exists S > 0 such that both the sets 

C={x£2G:p(x,G)^6}, 

D = {x £ 2 G : p (x, G) ^ if} 

are compact. Note that C is convex and balanced, and, since it contains G, generates 
X. We show that — C is dense in D. To this end, consider any x £ D and any e > 0. 
Choose t > 1 such that (t — 1) ||a:|| < e/2, and suppose that tx £ C. Then, since C 
is balanced, x £ C; whence x £ C fl D and therefore p (x, G ) = 5. But then for each 
g £ G we have 

|| tx — g\\=t ||x — t~ 1 g || ^ tp (x, G) = tS; 

so p(tx,G) > tS > 5, which is absurd since tx £ C. We conclude that tx ^ C. It 
follows from Proposition 3.1.2 that there exists y £ ~G such that || tx — y|| < e/2 
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and therefore \\x — y | < e. Applying Bishop’s lemma (Proposition 3.1.1), we see 
that y £ —C. This completes the proof that — C is dense in D. Since the norm 
function is uniformly continuous on the compact set D , it now follows that 

p (0, —C) = inf {||a:|| : x £ —C} = inf {||x|| : x £ Dj 


exists. 


□ 


A normed space with a compact generating set is said to be compactly generated. 

Theorem 6.6.6. A compactly generated Banach space is finite- dimensional. 

Proof. Let A be a compactly generated Banach space. We first suppose that A 
contains a nonzero vector. By Lemma 6.6.5, X has a balanced, convex, compact 
generating set C such that p(0,—C) exists. Since X = (J nC, we can apply The- 

n^l 

orem 6.6.1 to show that C° is inhabited; whence C contains a nontrivial ball. But 
every ball in a normed space is located, so the ball in question is totally bounded. 
It follows from Proposition 4.1.13 that X is finite-dimensional. 

It remains to remove the restriction that X be nontrivial. To do this, we work 
in the product Banach space Ixi This space is generated by the compact set 
G x {1}; so, by the foregoing, X x IK is finite-dimensional. It follows that A, being 
isomorphic to the quotient space (A x K)/K, is finite-dimensional. □ 


Recall from page 102 that a linear mapping T : X — > Y between normed spaces 
is said to be compact if T{B y(0, 1)) is a totally bounded subset of Y. 

Corollary 6.6.7. If T is a compact linear mapping of a normed space X onto a 
Banach space Y, then Y is finite- dimensional. 

Proof. The totally bounded set T (B( 0, 1)) generates Y. Since Y is complete, it fol- 
lows that T (B( 0, 1)) is a compact generating set for Y\ whence, by Theorem 6.6.6, 
Y is finite-dimensional. □ 


In the traditional development of functional analysis, the open mapping theorem 
is used to prove Banach’s inverse mapping theorem, the closed graph theorem, and 
the uniform boundedness theorem. The last of these three we have already discussed, 
in Section 2. To deal with the inverse mapping theorem we need another lemma 
about convex sets, and two further technical lemmas. 

Lemma 6.6.8. Let C he a convex, absorbing subset of a Banach space X. Then 

0?^C. 
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Proof. Assuming that 0 £ —C, we first show that X — nC is dense in X for each 
positive integer n. To do so, construct a sequence (x n ) n>1 in — C that converges 
to 0. Fixing a positive integer n, an element y of X, and £ > 0, compute positive 
numbers r,8 and a positive integer k such that —y £ rC, ||xfc|| < e/ (n + r), and 
\\x k — z\\ > 6 for all z £ C. Let 

yi=V+(n + r) x k . 


Then 


l|y-yi|| = (n + r) ||xfc|| < £. 


On the other hand, since C is convex, for each 2 £ nC we have 


(z - y) £ (nC + rC ) = C, 


so 


hi - 4 


(n + r ) 


Xk 


1 

n + r 


(z-y) 


^ (n + r) 6. 


Hence y\ £ — nC . Since £ > 0 is arbitrary, it follows that — nC is dense in X. 
Moreover, being a metric complement, — nC is open. Applying the standard form 
of Baire’s theorem, we now see that (") — nC is inhabited, which is absurd since 

n^l 

X = U nC. We conclude that 0^—0. □ 

n^l 


Sometimes when we want to prove that a certain proposition P is absurd, we 
first prove that P implies LPO, and then (frequently by adapting a classical proof) 
show that the addition of LPO to our intuitionistic logic suffices for us to prove 
that P is false. The next lemma will enable us to rule out in this way an unwanted 
alternative in the proof of Banach’s inverse mapping theorem. 

Lemma 6.6.9. Let f be a strongly extensional mapping of a complete metric space 
X into a metric space Y. Suppose that there exist a > 0 and a sequence (x n ) n>1 
converging to x in X such that p (f(x), f(x n )) > a for each n. Then LPO holds. 

Proof. Given an increasing binary sequence (A n ) n>1 with Ai = 0, construct a se- 
quence (z n ) n>1 in X such that 

• if A k = 0 for all k ^ n, then z n = x, and 

• if A n = 1 — A„_i, then z k = x n for all k ^ n. 

Then (z n ) n>1 is a Cauchy sequence in X and so converges to a limit z £ X. Either 
f(x) ^ f(z) or else p (f(x),f(z)) < a. In the first case, the strong extensionality of 
/ shows that x ^ z\ whence there exists n such that x ^ z n and therefore A„ = 1. 
In the second case, if there exists n with A n = 1 — A n _i, then we obtain the con- 
tradiction p (f(x),f(z)) = p (f(x),f(x n )) > a; hence A„ = 0 for all n. □ 
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Lemma 6.6.10. If LPO holds, then every separable subset of a metric space is 
located. 

Proof. Assuming LPO, consider a separable subset S' of a metric space X. Let 
(s„) n >i be a dense sequence in S, let x € X, and let 0 < a < (3. Construct a binary 
sequence (A„) n>1 such that 

A„ = 0 =t- Vfc < n ( p{x , s k ) < \ (a + /?)) , 

A „ = 1 3k < n ( p(x , s k ) > a) . 

By LPO, either A ra = 0 for all n, in which case, since (s n ) ra>1 is dense in S, 
p(x, s ) < \ (a + (3) < (3 for all s € S; or else there exists n with A n = 1 and 

therefore p(x , Sk) > a for some k ^ n. It follows from the constructive least-upper- 

bound principle that p ( x , S) exists. □ 


This brings us to Ishihara’s version of Banach ’s inverse mapping theorem. 

Theorem 6.6.11. Let T be a one-one, sequentially continuous linear mapping of 
a separable Banach space X onto a Banach space Y. Then T^ 1 is sequentially 
continuous. 

Proof. Let ( x n ) n>1 be a sequence in X such that Tx n — > 0, and let e > 0. By 
Corollary 3.1.7, the inverse linear mapping T _1 : Y — > X is strongly extensional; 
whence, by Lemma 3.2.2, either ||x„|| < e for all sufficiently large n, or else ||x„|| > 
e/2 for infinitely many n. It suffices to rule out the latter case. To do so, we may 
assume that ||x n || > e/2 for all n. By Lemma 6.6.9, LPO holds; so, by Lemma 
6.6.10, every separable subset of Y is located. Let {a n ) n>1 be a dense sequence in 
.By (0,1). Since T is sequentially continuous, (Ta„) n>1 is dense in T (Bx(0, 1)), 
which is therefore located. Writing x' n = 2e _1 x„, we see that \\x' n \\ > 1 and that 
Tx' n — > 0 as n — > oo. Since T -1 is strongly extensional, 

Tx'„e~T (B A -(0,1)). 

We now apply Lemma 6.6.2 with y = T(2x' n ), to produce z n £ X such that 
\\Tz n \\ < WTx'J+n- 1 and p (T (2z n ) , T (~B X (0, 1))) > 0. 

Then 

p (Tz n ,T (B x (0, A))) > 0 and Tz n — ► 0. 

Hence 

0 € -T (B x (0, §)). 

This contradicts Lemma 6.6.8, since T (Bx (0, |)) is both convex and absorbing. □ 
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Recall that the graph of a mapping T : X — > Y is the set 
S(T) = {{x,Tx) :xeX}. 

As classically, Banach’s inverse mapping theorem leads to a version of the closed 
graph theorem: 

Corollary 6.6.12. Let T he a linear mapping of a Banach space X into a Banach 
space Y such that Q (T) is closed and separable. Then T is sequentially continuous. 

Proof. The mapping p : ( x,Tx ) x of the Banach space Q (T) onto X is one-one 
and bounded linear. It follows from Theorem 6.6.11 that the inverse linear map is 
sequentially continuous, and hence that T is sequentially continuous. □ 


In the theory of unbounded operators, the graph plays a significant role. Par- 
ticularly important properties for such a graph are closedness and locatedness. See 
[26, 83, 92] for more on such matters. 


Exercises 

1. Complete the proof of Proposition 6.1.1. 

2. Complete the proof of Proposition 6.1.4. 

3. Prove that a bounded linear mapping T of a normed space X into C" is compact 
if and only if / o T is normed for each linear functional / on C n . Prove that 
if also the sum of any two normed linear functionals on X is normed, then T 
is compact if and only if pk o T is normed for each k, where pk denotes the 
mapping (zi,...,z n ) z k on C”. 

4. Let S' be a compact operator on a Hilbert space H, and A a bounded operator 
on H. Prove that 

(a) AS is compact for each A € C; 

(b) S* exists and is compact; 

(c) AS is compact. 

Prove also that if A* exists, then SA is compact. 

5. A subset S of a normed space X is said the be weakly totally bounded if it is 
totally bounded relative to the locally convex structure defined on X by the 
seminorms x |/(x)|, with / a normed linear functional on X. Prove the 
equivalence of the following conditions on X. 
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(a) The sum of any two normed linear functionals on X is normed. 

(b) The unit ball of X is weakly totally bounded. 

(Hint: To prove that (a) implies (b), note the second part of Exercise 3.) 

6. An operator on a Hilbert space H is said to be weakly compact if it maps the 
unit ball of H to a weakly totally bounded subset of H. Prove that an operator 
T on H has an adjoint if and only if it is weakly compact. 

7. Let (e n ) n>1 be an orthonormal basis of a separable Hilbert space H } and T an 
operator on H. Prove that the following conditions are equivalent: 

(a) T has an adjoint. 

(b) |(Te„,efc}| converges for each k. 

n— 1 

°° 2 

( c ) S \(Te n ,y)\ converges for each y £ H. 

n—1 

8. Let T be a weak-sequentially open operator on a Hilbert space such that ker ( T ) 
is located. Prove that T is well-behaved (see Exercise 3 of Chapter 3). 

9. Prove that every bounded linear mapping of a normed space onto a finite- 
dimensional Banach space is an open mapping. 

10. Show that the statement “every normed linear mapping T of a Hilbert space 
onto a finite-dimensional Banach space is compact” implies an omniscience 
principle. 

11. Show that the statement “every one-one compact linear mapping T of a Hilbert 
space into a finite-dimensional Banach space has finite-dimensional range” im- 
plies an omniscience principle. 

12. Show that the existence of the norms of the operators can be removed from the 
hypotheses of Royden’s version of the uniform boundedness theorem (Theorem 
6 . 2 . 11 ). 

13. Complete the details of the proof of Proposition 6.4.6. 

14. A partial isometry is a jointed operator U on a Hilbert space H for which there 
exists a projection P, called the initial projection of U, such that ||Lhr|| = ||a:|| 
for all x £ ran (P), and Ux = 0 for all x £ ran (P) ± - Prove that U*U = P, that 
UU* is a projection, and that ran (UU*) = ran (U) . 

15. Let T be a bounded operator such that T* exists and has located range, and let 
P be the projection of H on ran (T*). Prove that there exists a partial isometry 
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U on H whose initial projection is P such that T = U \T\ and |T| = U*T. Prove 
also that UU* is the projection of H on ran (T). (The expression of T as U \T\ is 
called the polar decomposition of T, and is analogous to the modulus-argument 
form of a complex number.) 

16. Prove that if a bounded jointed operator T on a Hilbert space H has a polar 
decomposition, then ran (T) is located. 

17. Prove the converse of Proposition 6.5.12: if a linear mapping T between normed 
spaces maps Cauchy sequences to Cauchy sequences, then T is sequentially 
continuous. 

18. Let E be a dense linear subspace of a normed space X, and T a sequentially 
continuous linear mapping of E into a Banach space Y. Show that T extends 
to a sequentially continuous linear mapping of X into Y. 

19. Prove that if G is a compact generating set for a Banach space X, then there 
exists a compact generating set G' for X that is balanced and convex. 

20. Let G be a balanced convex generating set for a normed space X. Prove that for 
each x £ X and each e > 0 there exist t > 0 and g £ G such that \\x — tg\\ < e. 
Can we replace £ by 0 in this result? 


Notes 

Many of the results in this chapter come from papers by Ishihara and the authors. 
The work of Sections 1 and 2 is largely drawn from [61] and [63]. For related classical 
material see [86]. It is not known whether Theorems 6.2.1 and 6.2.2 can be extended 
to possibly unbounded convex subsets of a normed space. More general versions of 
Exercises 3-6 appeared in Ishihara’s thesis [56]. 

Since we cannot guarantee that a bounded operator on a Hilbert space has an 
adjoint, when we discuss such matters as the Gelfand representation theorem for 
an operator algebra 21, we need to postulate that 21 is selfadjoint in the sense that 
each of its elements has an adjoint that also belongs to 21; see [9, 12]. 

The principle BD-N, introduced by Ishihara in [59], has the unusual feature of 
being provable classically, intuitionistically, recursively, but not, apparently, within 
BISH. It is an interesting problem — a part of constructive reverse mathematics 
[7, 62] — to identify classical theorems that are equivalent to BD-N. 

The usual classical proof of Lemma 6.4.7 is based on the nonconstructive 
monotone convergence theorem for sequences; see [47] (7. 3. 1.1). The constructive 
proof is much more informative, in that it provides the rate of convergence of the 
sequence of functions. A full constructive analysis of Dini’s theorem is given in [8]; 
see also [21]. 
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The existence of the square root of a selfadjoint operator is a special case of a 
more general result, the spectral theorem for sequences of commuting selfadjoint 
operators, which enables us to construct more general functions of an operator. 
Since that theorem requires measure theory, which we do not touch in this book, 
we refer the reader to the relevant chapters of [9] and [12]. 

It is easy to prove Lemma 6.5.3 classically, taking orthogonal complements in 
the identity ker(T*T) = ker(T*). 

For more on polar decompositions see [35]. 

Proposition 6.5.12 is trivial in CLASS, since in that context sequential continuity 
for linear maps implies uniform continuity. 

With classical logic, Theorem 6.6.1 is a simple consequence of the standard 
form of Baire’s theorem, the hypotheses of locatedness and the existence of p( 0, — C) 
being redundant. For other, constructively inequivalent, versions of Baire’s theorem, 
see Chapter 2 of [34]. 

For more on open mapping theorems see [29] and [25]. It is interesting that se- 
quential continuity, rather than boundedness, is the best we can get in the construc- 
tive versions of Banach’s inverse mapping theorem and the closed graph theorem. 
An extension of the former is given in [64] . 



What can be said at all can be said clearly; 

and whereof one cannot speak thereof one must be silent. 

— Ludwig Wittgenstein, Tractatus Logico-Philosophicus 
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The primary historical reference on constructive analysis is [9] , the review of which 
[85] is interesting in its own right. Later references for Bishop-style constructivism 
are [12, 34], the latter of which gives comparisons between BISH, INT, and RUSS. 
Beeson [6] and Troelstra-van Dalen [88] contain a wealth of information about the 
logic, philosophy, and practice of constructive mathematics. For some applications 
of constructive mathematics, see [20, 33, 92]. The definitive reference for construc- 
tive algebra is [72], but [49] should be consulted for more recent work in the field. 

The classic work on intuitionism is [48] . The life and works of Brouwer himself are 
discussed in [44, 45, 84]. Martin-Lofs early work on constructive mathematics is 
found in [68], and his theory of types appears in [69]. 

Among the most recent varieties of computable analysis is that of Weihrauch [91]; 
the translation of BISH into Weihrauchs framework is described in [5] . 
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pointwise, 15 
polar decomposition, 199 
polarisation identity, 175 
positive operator, 175 
power set, 19 
product, 34 
product metric, 37 
product metric space, 37 
product norm, 50 
product normed space, 50 
projection, 94 
proximinal, 88 
pseudobounded, 169 
Pythagoras’s theorem, 95 


recursive constructive mathematics, 6 
regular, 60 
represented, 99 

Riesz representation theorem, 99 
Riesz’s lemma, 86 
RUSS, 6 

selfadjoint, 102 
seminorm, 48 
seminorm, defining, 127 
separable, 38, 129 
separation theorem, 121 
sequence, 15 

sequentially continuous, 21, 39 

sequentially discontinuous, 77 

sequentially nondiscontinuous, 72 

sequentially open, 181 

series, 56 

set, 12 

set, basic, 18 

smooth, 148 

span, 81 

spanned, 81 

spectral theorem, 200 

square root, 64 

square root, operator, 179 

square summable, 93 

strong injection, 45 

strongly continuous, 44 

strongly extensional, 14 

strongly injective, 45 

strongly unique, 14 

subset, 12 

subspace, 50 

sufficiently large, 71 

sum, 33 

sup norm, 49 

supremum, 32, 40 


quantifiers, 7 
quasiproximinal, 88 
quotient norm, 55 
quotient space, 55 

range, 15 

rational numbers, 25 
real line, 27 
real number, 25 
real part, 36 


termwise, 15 
tight, 13 
topology, 37 

totally bounded, 38, 129 
trichotomy, 29 
Type-2 Effectivity, 22 

ultrafilter principle, 146 
ultraweakly continuous, 145 
unequal, 13 
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unequal real numbers, 26 

uniform boundedness theorem, 165 

uniform Cauchyness, 49 

uniform convergence, 49 

uniformly almost located, 76 

uniformly continuous, 40, 128 

uniformly convex, 54, 126, 151 

uniformly Frechet differentiable, 148 

union, 19 

unit ball, 48, 127 

unit kernel, 132 

unit vector, 48 

upper bound, 32 


upper order located, 32 
value, 14 

weak star topology, 130 
weak-sequentially open, 182 
weakly compact, 198 
weakly discriminating, 77 
weakly totally bounded, 197 
Weihrauch, 6 
well-behaved, 76 
WLEM, 58 

Zorn’s lemma, 116, 122 
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